Analyze your data usage with Opendatasoft’s new data lineage feature

Learn more
Glossary

Data Gathering

Data gathering is the systematic, methodological process of collecting raw data from inside and outside an organization.

What is data gathering?

Data gathering is the systematic, methodological process of collecting raw data, from multiple sources across an organization and beyond. It is the first step in any data management process, and using this data enables organizations to understand performance, make better informed decisions, test hypotheses and share data internally and externally.

Data gathering covers multiple types of information, both quantitative and qualitative, including:

  • Data created by systems in the course of their operations (such as IoT sensors, machinery, connected cars)
  • Data captured through human behavior, such as while a customer is navigating a website
  • Data entered into systems by customers/consumers, such as filling in online forms
  • Data entered into systems by staff, such as while on the phone to customers in a contact center
  • Data generated by business systems, such as sales, CRM or accounting solutions
  • Data captured by researchers directly from survey interviews or focus groups
  • Data gathered from third parties, such as partners or data brokers

What is the data gathering process?

We live in a world where data is being created at an exponentially increasing rate. Therefore, it is important to take a systematic approach to data gathering, rather than simply collecting data without a plan or methodology. Following this process is recommended:

  • Identify what you are aiming to achieve. This will affect which data you collect and where you collect it from
  • Select the right datasets to collect. Understand which data sources you are going to gather to achieve your goals and find out who owns the data
  • Quality check the data before beginning collection. Work with the data owner to ensure that the data meets your needs and has the level of granularity required
  • Create a documented process for data gathering. This should include how often data will be collected, where it will be routed to, how it will then be used, and the security that will be in place to protect it
  • Collect data. Automate the process of collecting data, but ensure that there is rigorous checking for errors to ensure it is in the right format and of high quality
  • Regularly review the entire process, particularly if there are any changes in the data source itself

What are the challenges to effective data gathering?

As it is the first step in the data management process, it is essential that data gathering is carried out correctly and systematically.

Organizations therefore need to overcome these key challenges to ensure their data gathering delivers value:

  • Ensure data quality. Data must be accurate and reliable, otherwise results and decisions taken later in the process will be unreliable. When gathering data ensure that:
    • It is standardized and is in the correct format, with descriptions of each field so that it can shared and combined later in the process
    • All relevant data is captured, without gaps or errors
    • You understand key parameters around the data, such as frequency of its creation and its granularity
    • It is consistent, using the same units (such as for time, speed or weight)
  • Ensure data is fit for purpose. Data may be generated for one purpose (such as running an industrial machine), but then used for other reasons (such as calculating the overall efficiency of a factory). Ensure that the data you gather is a good fit for your needs.
  • Ensure data is secure. Reduce compliance risk by ensuring data gathering is secure, meets regulations such as the GDPR/CCPA and protects personally identifiable information (PII)

 

Download the ebook making data widely accessible and usable

Learn more

How to create the best data experiences: key features that customers love when democratizing data Product
How to create the best data experiences: key features that customers love when democratizing data

Data democratization requires a strong data experience platform that is flexible enough to meet a range of user needs. In this blog we bring together a selection of our customers’ favorite features that help them save time and deliver compelling data experiences.

“Building a data democratization platform means adapting to every organization’s tech stack” Product
“Building a data democratization platform means adapting to every organization’s tech stack”

Our customers now enjoy improved connection features for retrieving data from a variety of sources, including SharePoint and Google Drive, and for quickly creating datasets. We sat down with Coralie Lohéac, the project’s coordinator, to find out more.

From SharePoint to Google Drive, our new connections increase the value of your data Product
From SharePoint to Google Drive, our new connections increase the value of your data

We are further expanding the range of connections between the Opendatasoft platform and third-party applications, ensuring integration with any technology stack. This improvement comes alongside another time-saving update: a complete redesign of the dataset creation process.

How to create the best data experiences: key features that customers love when democratizing data Product
How to create the best data experiences: key features that customers love when democratizing data

Data democratization requires a strong data experience platform that is flexible enough to meet a range of user needs. In this blog we bring together a selection of our customers’ favorite features that help them save time and deliver compelling data experiences.

“Building a data democratization platform means adapting to every organization’s tech stack” Product
“Building a data democratization platform means adapting to every organization’s tech stack”

Our customers now enjoy improved connection features for retrieving data from a variety of sources, including SharePoint and Google Drive, and for quickly creating datasets. We sat down with Coralie Lohéac, the project’s coordinator, to find out more.

From SharePoint to Google Drive, our new connections increase the value of your data Product
From SharePoint to Google Drive, our new connections increase the value of your data

We are further expanding the range of connections between the Opendatasoft platform and third-party applications, ensuring integration with any technology stack. This improvement comes alongside another time-saving update: a complete redesign of the dataset creation process.

Start creating the best data experiences

Request a demo