EIIG automatically builds a Data Catalog inventory of all data assets at the organization using the metadata it captures when it’s ingestors connect to the firm’s systems and tools that produce, transmit, store allow consumption of data within the corporate data landscape. EIIG provides a search capability for the data catalog so users can quickly find the data relevant to their needs.

This comprehensive and searchable metadata set when organized as well as EIIG does, becomes the go to spot for users to find, expand and collaborate on their data analytics efforts, thereby empowering them to drive or accelerate business initiatives, performance and execution. Having a data catalog allows the firm to break down complexities introduced by Big Data (unstructured data), Vendor tool environments that don’t interact with each other well, siloed business units, hoarding of legacy data fearing loss of future insights etc. It also helps to quickly break down data into business relevant data or technology relevant data. This helps manage the datasets better.

Features of the EIIG catalog

Capability Description
Search and Discovery Possibility to search across all assets and information in an faceted way (browsing through system, schema, tables etc. without need to know table name.)
Data Profiling and Data Quality Data sampling and automatic profiling: data preview, data classification, quality, statistics (e.g. giving information on technical metadata, what fields, values, sample of data, data quality classification)
Create articles within the catalog (Wiki-style) Possibility to arrange knowledge tied to assets beyond 200 words (e.g. tags, information, how data model is created, how it can be used)
Custom data fields Ability to add field to metadata model of data catalog to e.g. describe retention time, purpose ID etc.
Glossary Management Support for synonyms, acronyms, relationships, import/export (company/country level)
Asset-to-term matching Matching assets like columns to business terms
Data Quality Integration Ability to integrate DQ functionalities/information from other 3rd party solutions into the catalog. To be considered how Data quality is presented in the tool (usability). (e.g. Great Expectations or AWS Deequ)
Portability Ability to export all data catalog contents either through UI or via dedicated APIs
Security and Access Control Security controlled access to datasets and granular authorization scheme for segregation of duties
Roles and Workflow Ability to define role and data governance workflow in the catalog
Ratings Users can rate, endorse, add warnings, put deprecations on assets (will not be included in first stage roll out but minimum requirement for next stage crowd sourcing)

The EIIG Data Catalog is built within days of an engagement with Orion and is kept up to date in near real-time. The findings from the data catalog allows changes to be made at the data sources to manage data better and/or allows to transform the data in transit in the data supply chain, thereby optimizing operations to the needs of the firm, be it migrating to a cloud, building a data lake or rolling out an ERP/CRM.

In all cases Orion EIIG is the tool to get for your Data Catalog.