Responsibilities:
- Design, create, maintain, and optimize data pipeline architecture on cloud platform
- Perform data preparation (ETL), data cleansing on large and complex data sets from multiple sources
- Work with business units/data analyst/business intelligence/data scientist to understand data needs and support data infrastructure
- Work with developers to define data tagging requirements
- Ensure quality of data and keep track of data lineage
- Implement data governance policies to comply with regulations
- Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc
- Help CDO/Team Lead on coaching data engineers to ensure a culture of high performance
Preferred qualifications:
- Proven experience building big data architecture in production
- Experience in cloud platform
Skills & Experiences:
- 3+ years of experience in data engineer or data architect role
- Bachelor's Degree in Computer Science or related fields
- Knowledge of big data architecture concept and design
- Experience in relational database (SQL) and NoSQL databases
- Experience in data pipeline and workflow management tools (eg Luigi, Airflow) and stream processing
- Experience supporting and working with cross-functional teams in a dynamic environment is a plus
- Communication skills including the ability to understand business process in any area in detail
- Growth mindset and willingness to learn new things and share with others