Job Purpose :
Data Engineer plays a crucial role in developing and optimizing our data architecture to support data-driven initiatives and enhance overall business intelligence.
Key Responsibilities :
- Design, implement, and maintain scalable and robust data architecture to support business needs.
- Collaborate with stakeholders to understand data requirements and ensure alignment with organizational goals.
- Develop and implement efficient Extract, Transform, Load (ETL) processes to integrate data from various sources into a unified format.
- Ensure data integrity and quality throughout the ETL pipeline.
- Develop and Implement event streaming framework and services.
- Design and manage relational and non-relational databases, ensuring optimal performance and scalability.
- Implement database optimization strategies and conduct performance tuning.
- Utilize big data technologies such as Hadoop, Spark, and Kafka to process and analyze large volumes of data.
- Implement solutions for real-time data streaming and processing.
- Develop and maintain data models for efficient storage and retrieval of information.
- Collaborate with data scientists and analysts to understand data requirements for analytical purposes.
- Create and maintain metadata repositories, documenting data lineage, definitions, and relationships.
- Implement and enforce metadata management best practices.
- Implement data quality checks and validation processes to ensure accuracy and reliability of data.
- Collaborate with cross-functional teams to address and resolve data-related issues.
Qualifications :
Bachelor s degree in computer science, Information Technology, or a related field.8-9 years of proven experience as a Data Engineer or in a similar role.Strong proficiency in SQL and database management systems (e.g., MySQL, PostgreSQL, MongoDB).Experience with big data technologies (Hadoop, Spark) and data processing frameworks.Knowledge of ETL tools and techniques.Knowledge of DevOps practice.Knowledge of event streaming frameworks (Kafka, Event Hub, Apache Storm, Stream Analytics etc)