Data landscape consolidation banner

Data landscape consolidation

Overview

Challenge

  • Products collect, process and store survey and non-survey data from the same type of source systems (org partners): survey engines, sampling engines, panel management systems, custom reference and master systems, external data providers. Such redundant data extractions and storage systems on organization level lead to costs overheads and inefficiencies.
  • Products leverage technologies for data collection, processing and storage they prefer which leads to lack of reuse, increased engineering efforts and maintenance costs on organization level.
  • There is lack of data reusability and data trust across products as well as lack of data discoverability for data exploration and analytical needs across products.
  • Inconsistent data management & security leading to operational & security risks.

All these challenges increase the demand for relevant software capabilities and new solutions to maintain IT/OT software applications within system operation and to cope with the increasing demand for change.

Approach

To address customer challenges the proposal was to build data platform leveraging Hub-and-Spoke architecture pattern and Azure PaaS technologies with a goal to:

feature chip icon

Unified data source consolidation

Act as single source of company data (survey data, reference data, master data, sales, media and other data assets) segregated by data domains for all the new data products.

arrow down icon

Streamlined data collection and utilization

Reduce costs and complexity involved in data collection thus reducing total time to market for new data analytical initiatives and increasing organizational capabilities reuse.

feature calculator icon

Data landscape cost optimization

Reduce overall cost of data landscape by:

  • Reducing redundant data extraction by multiple applications.
  • Reducing redundant storage for same raw data across multiple products.
  • Reducing complexity for data engineering across the organization.
feature parts icon

Engineering efficiency and acceleration

Reduce engineering efforts and accelerate delivery of new analytical data products in a governed way through unified technology stack, reusable platform assets like data pipeline templates, IaaC modules, CICD pipeline templates, etc.

feature person icon

Automated data governance enablement

Enable data management through automated data governance processes and centralized Data Catalog containing metadata of company data estate

feature shield-check icon

Security framework establishment

Establish security framework through automated security policies, auditing, monitoring, defined access models and security checklist / processes to follow

feature laptop icon

Data exploration & analysis facilitation

Provide data exploration capabilities for platform users for analysis and hypothesis verification needs

Delivery of the data platform was based on Agile practices and focused only on capabilities required for identified high-priority business and platform use-cases.

Achievements

feature arrow down icon

Efficient data management and cost reduction

Significant reduction in redundant data extraction and data storage, reducing costs, increasing efficiency & data reusability

feature person icon

Enhanced data engineering efficiency

Streamlined data engineering across company reducing 25% to deliver new analytical data products

feature slices icon

Creation of unified platform-level data products

Established multiple platform-level data products with consolidated company data assets for various data domains used by analytical products, power users and advanced analytics

feature rocket icon

Rapid analytical product innovation

Accelerated delivery of 3 brand new analytical products leveraging reusable data platform assets and unified technology

feature puzzle icon

As-a-service platform for analytical products

Introduction of as-a-service platform capabilities for analytical data product needs with support of API-based and event-based integrations

feature square icon

Infrastructure and DataOps enhancement

Enhanced monitoring, capacity management & chargebacks, infrastructure security & standardization through Infrastructure-as-a-Code and DataOps implementation

feature shield-check icon

Data exploration environment satisfaction

Satisfaction of data analysts and other power users through data exploratory environments capability

feature chip icon

Advanced data management and security

Improved data management across company through automated metadata capturing, data discoverability via Data Catalog, self-service data access, governed business glossaries and enhanced data security based on company policies

Tech stack

Azure services

  • Dala lake gen2 icon
    Data lake storage gen2
  • key vault icon
    Key vault
  • functions icon
    Functions
  • Purview icon
    Purview
  • Data factory icon
    Data factory
  • api management icon
    Api management
  • Data lake icon
    Data lake
  • SQL Managed instance icon
    SQL Managed instance
  • sql icon
    SQL
  • Event grid icon
    Event grid
  • DevOps services icon
    DevOps services

Big data

  • databricks icon
    Databricks
  • deltalake icon
    Deltalake
  • PySpark icon
    PySpark

Code quality tools

  • sonarcube icon
    SonarCube

IT service management

  • servicenow icon
    ServiceNow

Frameworks

  • flask icon
    Flask

Infrastructure automation

  • terraform icon
    Terraform

Programming languages

  • Python icon
    Python

Cloud

  • Azure icon
    Azure

Case studies

We are well-versed in the dynamic world of development across a variety of industries.

Contact us

Anfimau Industry Solutions GmbH

Managing director: Mikhail Anfimau

contact us

Mergenthalerallee 15-21 65760 Eschborn, Germany

Phone

+49 6196 7008475

Tax number

040 228 55754

VAT ID

DE345344498

Trade registry

HRB 123580