There are no second-class data sources. With 70+ connectors to the most common ones, and an SDK for those less-common, Datameer lets you load structured, semi-structured, and/or unstructured data from any source with ease.
Upload a single file, import an entire dataset, or link to a data source so new data is pulled in as frequently as you need it. Datameer loads all data in raw format directly into Hadoop. The data integration process is optimized and supported with robust sampling, parsing, scheduling and data retention tools that make it simple and efficient for any user to get the data they need quickly.
Datameer provides fully automated partitioning, compaction, compression and other services that can easily tame even your largest data sets. Incremental data loading can be configured with time- or data-driven triggers, or can connect with existing job scheduling and monitoring tools to ensure you’ve captured every record. Datameer’s fault-tolerance features include email alerts and detailed reports, and they filter out corrupt, dirty, and incomplete data, based on configurable thresholds.