[WEBINAR] Product Talk: Using AI to enhance the data marketplace search experience

Save your place
Data self-service

Transform your data catalog into an internal data portal to create greater value

Data catalogs are fundamental tools to inventory data, but are not sufficient to truly democratize it. Discover why the creation of a data portal is essential to unlock true value and build a data-driven organization.

Brand content manager, Opendatasoft
More articles

As the volume and variety of data produced by organizations continues to grow, data catalogs are increasingly vital tools, providing organizations with the ability to document and organize their data assets. However, data catalogs alone are not enough to democratize usage and create value.

The creation of a data portal is the key to exploiting data and facilitating its reuse beyond experts and specialists within organizations. In this blog find out more about the limitations of data catalog solutions, and what you can do to generate greater value from your data at scale.

Copy to clipboard

Faced with ever-increasing quantities of data, data analysts and data scientists need tools to sort and document their data assets. This is what a data catalog does, facilitating data collection, quality control, enrichment and analysis.

What is a data catalog?

A data catalog is an inventory of all an organization’s data. In addition to bringing data together in a single location, the catalog contains metadata that describes these assets and enables users to determine whether or not it is relevant to their needs.

Why use a data catalog?

Implementing a data catalog delivers a number of benefits to organizations including:

  • Documentation: all data is cataloged in one place, providing centralized information on every data asset.
  • Search: thanks to filtering, sorting and search functions, users can easily locate information relevant to their needs.
  • Time savings: as data is easier to find, technical teams save time when managing, using and exploiting data.
Copy to clipboard

While the implementation of data catalogs is becoming a requirement for data-driven organizations, the technology itself has several limitations.

A complex solution solely for experts

The majority of data catalog tools are relatively complex, codified solutions. To use these tools, specific technical skills are required, both for implementation and for data retrieval. This limits their use to trained experts.

However, in order to unlock real value from data across the organization, everyone needs easy access to data, not just experts with advanced technical skills. Data has to be available to all – for example to staff in HR, marketing, finance and operations, without requiring them to master complex new tools.

Limited use

In addition to their lack of simplicity, data catalogs have a limited use. Essentially they provide a searchable metadata catalog. This makes it easier to identify data,
but this doesn’t go far enough. They don’t help to create new services or optimize decision-making using this data. Instead, organizations need to make data easily accessible, comprehensible and reusable, and, importantly, available to everyone on a self-service basis.

Copy to clipboard

While a data catalog is vital for the development of a data strategy, it is insufficient on its own to deliver value at scale from your data.

To democratize data in your ecosystem, it must be:

  • quality-enhanced, enriched and formatted. Data must be cleansed before being made accessible to all on a data portal. This creates trust with users and increases data sharing across your organization.
  • accessible by all audiences that you choose via a single access point. Through powerful, intuitive search and filter functions, users with no technical expertise should be able to find ready-to-use data in just a few clicks.
  • presented in formats that meet user needs through data visualizations, dashboards, graphs, or raw formats. Data must also be easily reusable with the organization’s business tools.
  • documented with metadata (DCAT, DCAT-AP, INSPIRE) so that it can be understood by all and reused with confidence.
  • available in formats that encourage reuse through the ability to export in common formats including Excel, CSV, APIs and as GIS data, such as GPX for smartphone mapping.

Ebook - Data Portal: the essential solution to maximize impact for data leaders

Copy to clipboard

Métropole de Rouen Normandie: from geographic data catalog to open data portal

The Rouen Normandie metropolitan area decided to adopt a data catalog to manage its geographic data in 2017. A data directory already existed, but it was not updated, making access to the data relatively complex.

To take data sharing a step further, it decided to implement an open data approach to deliver two main objectives:

  • Break down silos between departments and automate processes through connectors, web services and data harvesting. The aim was to make publishing data as easy as possible for data producers, and to strengthen data governance processes.
  • Promote an open data approach in the region, thanks to a single access point to all of the area’s data. This supports greater public transparency and strengthens citizen engagement and participation in government. It also encourages data reuse and innovation: by improving knowledge of the area’s data assets, it’s easier to reuse the available information and therefore to innovate.
Flore Bonhomme
“Opendatasoft's capabilities make it possible to enhance and showcase data. This is true of its data visualization tools, which enable us to create thematic maps or dashboards. Data can also be reused by various applications, whether third-party or developed in-house, such as our mobile app, which relies on open data."
Flore Bonhomme
Intelligent Territory and Open Data Project Manager, Métropole de Rouen Normandie

Schneider Electric: creating new uses for corporate data

Schneider Electric’s technology stack includes multiple tools dedicated to data management, including data catalogs. But these did not allow the organization to truly democratize data and add value to its usage.

Schneider Electric therefore turned to Opendatasoft to set up an internal portal accessible throughout the company: the Data Library. Thanks to this internal data portal, everyone has full access to data. It means that data scientists can concentrate on analyzing data rather than preparing and processing it, while employees now have access to a series of customizable dashboards, and can share datasets internally.

However, this is not the only benefit of moving beyond a data catalog. Schneider Electric has also been able to create Exchange, a data services marketplace which brings together a wide range of data, resources and tools for sharing and monetizing its data with partners. This would have been impossible with a simple data catalog.

"This project has enabled us to demonstrate that we are capable of capturing data, processing it, enriching it and adding value to it, across the fields of IoT, energy management and industrial use. The platform has given us the ability to understand how we can generate new value from our data, and that's very powerful for our business."
Herveig Lecuyer
Data Factory Program Lead, Schneider Electric

Data catalogs are indispensable tools for inventorying data and supporting data governance policies. However, they are not enough to make data easily accessible and consumable internally by as many people as possible. They don’t help accelerate the wider use of data or release its value. Creating a data portal enables organizations to take this next step, and unlock the full potential of data, helping to democratize its use and build truly data-centric organizations.

Articles on the same topic : Self-service data Data catalog Data portal Metadata

Learn more
How to break down organizational silos to engage everyone in your data project Data access
How to break down organizational silos to engage everyone in your data project

Organizational silos prevent data sharing and collaboration, increasing risk and reducing efficiency and innovation. How can companies remove them and ensure that data flows seamlessly around the organization so that it can be used by every employee?

What is the difference between a data product and a data asset? Data Trends
What is the difference between a data product and a data asset?

Data products and data assets both aim to make data usable and valuable. What are the differences between the two and how do you incorporate them into your data strategy?

The central role of data in delivering the Paris 2024 Olympic and Paralympic Games Company news
The central role of data in delivering the Paris 2024 Olympic and Paralympic Games

As we get closer to the start of the world's biggest sporting event, we look at the role of data in preparing for the Paris 2024 Olympic and Paralympic Games, which start on July 26th 2024.