Implementing a Scalable Data Engineering Platform for a Trust and Corporate Management Company

The client needed a robust data engineering platform to collate and transform data from multiple ERPs, ensuring scalability, cost-effectiveness, and the ability to incorporate micro-strategies over the data architecture.

Challenge

The primary challenge was to create a scalable and cost-effective data engineering platform capable of integrating multiple ERPs while minimizing service delays and maintaining high performance.

Scalability and Cost Efficiency

Designing a thin and scalable data engineering layer suitable for the cloud with pay-per-use to minimize costs.

Integration and Transformation

Handling data integration and transformation from multiple ERPs to ensure seamless data flow and accurate analytics.

Solution

Entrans developed a comprehensive data engineering platform with Azure architecture, integrating various technologies to achieve the client's objectives.

Result: The platform provided efficient data integration, transformation, and storage, ensuring scalability and cost-effectiveness while supporting future growth.

Detailed Solution:

  • Azure Architecture: The solution utilized Azure architecture to provide a scalable and cost-effective platform. The use of Azure DataBricks with Spark allowed for efficient data processing, despite slower start times and non-serverless nature. The architecture separated computation from storage, with Blob storage used for data storage.
  • Curated Data Lake: The preliminary stage involved creating a curated data lake before loading data into Azure Datawarehouse. Transformations were handled in Spark, and the transformed data was then pushed into Blob storage, which synced with Azure Datawarehouse.
  • Scalable Instances: Startup times were required while running as spot instances, and instances were scaled on demand to ensure optimal performance.

Impact: The implementation led to a scalable, efficient, and cost-effective data engineering platform that streamlined data integration and transformation, enhancing the client's ability to make data-driven decisions.

A screenshot of a dashboard showing the number and type of search results.

API Subscription

Streamlining API Acquisition and Management with Subscriptions Image Content

The Subscriptions Image Content feature makes acquiring APIs more straightforward by offering users a seamless, visual interface to browse and subscribe to various APIs. This functionality eliminates the complexity of traditional API subscription processes, making it easier for users to integrate APIs into their systems quickly. In addition to simplified access, this feature provides users with powerful tools to monitor and track their API usage. Through detailed metrics and insights, users can view API call volumes, performance statistics, and cost data, enabling them to manage consumption effectively. This transparency empowers users to optimise their API usage, control expenses, and ensure resources are utilized efficiently over time. By offering both ease of access and robust tracking capabilities, the Subscriptions Image Content feature helps users make informed decisions and gain deeper insights into how APIs contribute to their operations, ultimately improving overall resource management and project outcomes.

Outcomes

01

The platform provided a thin and scalable data engineering layer suitable for the cloud with pay-per-use, minimizing costs.

02

The platform ensured seamless data flow and accurate analytics by handling data integration and transformation from multiple ERPs.

03

The architecture allowed for the addition of any micro-strategy over the data architecture, supporting future growth and scalability.

Technology Stack and Architecture

01

02

03

04

Methodology

Key step of the project are the following

1
2

Do you have further inquiries or require tailored assistance?