Use case – Building An Azure Data Hub for BI & Data Science

Our client is a retail organisation, seeking to future proof their Data platform to provide greater accessibility to their data for a wider of range of reporting, analysis, and potential Data Science requirements.

They have significant experience with traditional Relational Database Data Warehouses and reporting with On-Premise SQL Server deployments through SSRS and more recently with Power BI against a freshly developed Hybrid Cloud Data Warehouse implementation.

They are eager to future proof their investment data to make their data more accessible for both internal and external analysts and to embrace current technology trends particularly with regards to Data Lakes, Big Data analytics, and Machine Learning and Data Science.


Business Requirement

 
Provide a standardised, centrally administrable gateway to data
 
 
Provide simple, secure and rapid access to a wide variety of data sets
 
 
Provide access to different audiences, both internal and external
 
 
Consolidate structured, semi-structured and unstructured data
 
 
Develop a set of golden rules for managing the Data Lake
 
 
Plan a scalable solution
 
 
Provision for potential future development in the areas of Artificial Intelligence, Machine Learning and Data Science
 


The PTR Solution

The proposed solution was built on the Microsoft Azure Data Platform.

  • Azure Synapse (Data Warehouse)
  • Azure Data Lake
  • Azure Data Factory
  • Azure Databricks


The Benefits

  • A single consolidated and clean repository of quality data from all business applications serving as a single source of truth for the organisation
  • A central data hub that can server both BI and Data Science requirements.
  • Integration of data from disparate sources, types, locations and custodians
  • Direct connectivity to raw unstructured data
  • Data science, Machine Learning and AI ready data sources