This Python package is used for generating HTML reports about the contents of Esri geodatabases. It supports working with Esri personal/file geodatabases as well as RDBMS enterprise geodatabases (however, it also works with RDBMS databases that were not geodatabase enabled).
Example of report:
In order to use this tool, you would need to have either:
- ArcGIS Desktop, ArcGIS Server or ArcGIS Pro installed (for
arcpy
package). You will be able to run the tool with ArcGIS Desktop/Server Python 2.7 as well as ArcGIS Pro Python 3.5+. - GDAL (for
ogr
package). This means you will be able to run the tool without having any Esri software installed. However, only file geodatabases are supported for reporting with GDAL.
The code written is a valid Python 2 as well as Python 3 code, so using the tool in both environments is supported. You will need pandas
and beatifulSoup
Python packages which can be installed from pip
if you are ArcGIS Desktop/Server user or using conda
if you are ArcGIS Pro or Anaconda user. See Installing Packages in the Python documentation (for ArcGIS Desktop users) and Install a package in the Conda documentation (for ArcGIS Pro and Anaconda users) to get help.
If the package is able to import arcpy
, then it will use arcpy
because it provides a more complete view into your geodatabase. The most time is spent in arcpy
describe and listing functions iterating the datasets and pulling all the relevant information.
If arcpy
is not found in your system, then the package will try to import ogr
. The OpenFileGDB
driver is used to access the key information about the geodatabase and its items. Please keep in mind that when using ogr
, no information about tables/feature classes indexes and subtypes will be reported and many of the ArcGIS specific properties won't be shown either.
After HTML file is generated, you can open it in a web browser. The HTML file uses resources in other folders located in the app
folder which means you cannot move the HTML file somewhere else without copying its dependency folders. If you would like to move the report somewhere else, you need to copy the whole app
folder. While having the HTML file shown in a web browser, you can use the links to navigate throughout the page. Keep in mind that since navigation is built using the headers id, you can use the web browser's tools for navigating forward and back as you navigate between different sections of the report.
- arcpy | GDAL 2.1.0
- Python 2.7 | 3.5
- pandas >= 0.20.1
- beautifulsoup4 >= 4.6.0
python setup.py install
Please keep in mind that arcpy
and GDAL
won't be installed when installing the package. The arcpy
package can be obtained by installing ArcGIS software and GDAL can be installed in many ways, for instance, using Anaconda Navigator or using a binary wheel.
When calling the reporting function, you'd need to supply path to your geodatabase and output folder for report data files, and optionally specify what information you would like to include into your output HTML report. Depending on the geodatabase size, number of chosen items to include in the report as well as the available system resources, it might take some minutes to run.
The package has convenience functions which can be used to specify what exactly do you want to generate report for. The gdb2html
function will create a complete report. However, you can specify if you want to report only domains, only tables, or only feature classes using the domains2html
, tables2html
, and fcs2html
functions respectively. If you would like to have a fine-grained control over what information will be included in the report, you can use report_gdb_as_html
function which has a few booleans you can set (for instance, you may want to get only list of tables and feature classes without reporting their fields, subtypes, and indexes).
To generate full geodatabase report:
import registrant
registrant.gdb2html(r"C:\GIS\Production.gdb", r"C:\GIS\ReportFolder")
To generate report listing only tables and feature classes (with no information on fields, subtypes, and indexes):
import registrant
registrant.report_gdb_as_html(r"C:\GIS\Production.gdb",
r"C:\GIS\ReportFolderTablesFcs",
do_report_domains=False,
do_report_domains_coded_values=False,
do_report_tables=True,
do_report_tables_fields=False,
do_report_tables_subtypes=False,
do_report_tables_indexes=False,
do_report_fcs=True,
do_report_fcs_fields=False,
do_report_fcs_subtypes=False,
do_report_fcs_indexes=False)
To generate report listing only domains and coded values for domains:
import registrant
registrant.domains2html(r"C:\GIS\Production.gdb", r"C:\GIS\ReportFolderDomains")
This tool uses arcpy
package (and if you don't have any ArcGIS software installed - ogr
package) to read properties of geodatabase into Python dictionaries which are used then to construct pandas
data frames. The data frames are exported into HTML tables (as large strings) using built-in pandas.DataFrame.to_html
method. To merge all the HTML tables into a single page, beatifulSoup
package is used. The HTML report page is built using the Bootstrap 3 Dashboard sample. Some extra functionality is added with the help of Bootstrap 3 DataTables extension. Additional navigation items in the table of contents are added to the HTML page on-the-fly while reading geodatabase tables and feature classes.
-
Why not just use ArcGIS Diagrammer to generate HTML reports?
ArcGIS Diagrammer was deprecated at ArcGIS 10.3 which means only users of ArcGIS 10.0-10.2 can use it. If you have ArcGIS 10.3 or above, you can still install it on your machine, but you would need to have the .dll files of ArcObjects SDK for .NET of version 10.2 available.
The
registrant
package tool is not a replacement of ArcGIS Diagrammer as it can only report the information about the geodatabase contents (you cannot design a new geodatabase). However, this tool reports a lot of useful information about the contents of geodatabase as ArcGIS Diagrammer does. This means that you might not need to install ArcGIS Diagrammer if you are only interested in getting the HTML reports about the geodatabase contents. -
I would like to see the information about
X
being reported, but the HTML report doesn't provide any information about it. Can you please add it?Feel free to post an issue in this repository and I will see what's possible.
-
Why does it take less time for ArcGIS Diagrammer to generate HTML report about my geodatabase?
ArcGIS Diagrammer was built using fine-grained
ArcObjects
libraries which provides low-level access to the underlying geodatabase datasets and their properties. With this Python package, reading the geodatabase is done usingarcpy
Python package which makes it really easy to write and maintain this package, however there is a price to pay for that; it takes extra time to convertArcObjects
COM objects to Python objects and vice versa which results in longer processing time. Reading properties of geodatabase datasets withogr
happens much faster than when usingarcpy
, but you don't get all the properties exposed via this interface.If you need to produce HTML reports fairly often and performance does matter to you, try to leave out some of the report options to save time. You could, for instance, skip reporting indexes and subtypes for every table and feature class if you have a large number of datasets in your geodatabase.
- General geodatabase overview
- Domains & coded values
- Tables & Feature classes
- Table & Feature class fields
- Table & Feature class subtypes (
arcpy
only) - Table & Feature class indexes (
arcpy
only)
All fields in tables and feature classes have a property showing the order of the field (UI order
) within the dataset as shown in the dataset properties window in ArcGIS Desktop or Pro that is the order in which the fields were added. This is the order in which fields appear when you open the Feature Class Properties or Table Properties window in ArcGIS Desktop or when you access the Fields window in ArcGIS Pro.
As a note, Unicode characters are supported in geodatabase table names, field aliases and so forth. The web page should be drawn using the utf-8
encoding. Should any characters appear strange, make sure you are viewing the report page in the proper encoding:
- Firefox:
View > Text Encoding > Unicode
- IE:
View > Encoding > Unicode (UTF-8)
- Chrome: auto-detect should do the magic
-
Navigation panel to the left lets you quickly jump between different report sections.
-
All columns are sortable (ASC | DESC) with the option to choose a number of entries to show for every data table and use paging. Every section also has the Search panel for text search within the section (e.g., search for the field name in a particular feature class) that will filter out table rows with no matches.
-
All feature classes have a single default subtype defined for them, so to save the space the information about subtypes will be reported only when there are at least two subtypes (which means at least one subtype has been added by the user).
-
By using the
Print
link one can create a printable representation of the report; opening printable report can take some time depending on the number of items in your geodatabase. The report page will be visible in the new browser tababout:blank
. The generation of printable report works best in Firefox. Please mind that Chrome and IE can choke when the report content is very large.
- Reorder columns within the data tables https://datatables.net/extensions/colreorder/
- Add mapping for
arcpy.Field
data types and ArcGIS Desktop (String - Text etc)
-
Cd to the
tests
folder and runcoverage run -m unittest discover
. This will create a.coverage
file which contains the metadata about code coverage. -
Cd to the
tests
folder and runcoverage html -d coverage_html --omit "C:\Program Files (x86)\ArcGIS\Desktop10.5\*"
. This will generate a nice.html
report highlighting the covered code. The--omit
flag is used to exclude calls toarcpy
in the report.
Did you find a bug? Do you need report to include some other information? Please let me know by submitting an issue.
MIT