Data Collection Services
The Big Data and Artificial Intelligence revolution require large amounts of data from multiple sources.
The data sources can be internal, such as ERPs, CRMs and corporate files, or external. External data sources include all publicly available data. The trend in Open Data theoretically enables everyone to access multiple types of information. Numerous governments, cities and public organizations increasingly publish data openly.
However, collecting this data is not always simple, due to the large heterogeneity in the data sets and in the methods to collect the data, such as file downloads and APIs. Furthermore, the data is not always properly indexed.
Basedig provides a platform dedicated to the automated data collection from multiple sources.
Collecting data and metadata together
Basedig collects the raw data in whichever form it is available (csv or excel files), APIs, HTML content, and imports it into a unified content container called a “Basebook”. A basebook contains both the data itself and the metadata.
The metadata includes information such as name of sets, source of data, potential application licenses, revision dates, type of different data columns.
A data base search engine
When the data is published on the Basedig platform, it is indexed by Basedig’s seach engine. When a user searches for a specific type of information, Basedig very quickly identifies the data sets in which the information may be.
An API and a data download utility
Data available on Basedig can be downloaded as a file (CSV, Excel or Json) or via an API.
Custom data collection
Basedig also offers data collection services for specific needs. Do not hesitate to contact us if you have a project.
Other articles published on Basedig
List of investment funds in France by categories
Optimisation des stocks par le big data, l'intelligence artificielle et le machine learning
How can artificial intelligence contribute to the current corona virus crisis