The data hub layer helps manage internal and external sources of data with data lake structures specified by industry and domain. Data engineers can easily and intuitively interact with the data to enable data discovery and data preparation.
Raw Layer: Store and manage raw data in multiple formats, such as HDFS, Parquet, Hive, Columnar or another type of object storage.
Data Ingestion: Manage multiple real-time and batch workloads as well as data acquisition between your internal and external data providers.
Data Curation: Utilize microservices to curate data from multiple formats into one and transform data for specific use cases.
Data Conform: Conform and structure your data using IBM industry-based data models into useable formats needed for digital, operational and analytical processing.
Data Consumption: Create capabilities to acquire, transport and validate data between the data hub and any layer of Digital Insights to use for analytic, cognitive or digital processes.