Data Engineer
Hyderabad
Appen
See how Appen provides data to improve AI, guide our customers to driving innovation, accelerating AI development, and staying ahead of the competition.
About Appen
Appen is a leader in AI enablement for critical tasks such as model improvement, supervision, and evaluation. To do this we leverage our global crowd of over one million skilled contractors, speaking over 180 languages and dialects, representing 130 countries. In addition, we utilize the industry's most advanced AI-assisted data annotation platform to collect and label various types of data like images, text, speech, audio, and video.
Our data is crucial for building and continuously improving the world's most innovative artificial intelligence systems and Appen is already trusted by the world's largest technology companies. Now with the explosion of interest in generative AI, Appen is helping leaders in automotive, financial services, retail, healthcare, and governments the confidence to deploy world-class AI products.
At Appen, we are purpose driven. Our fundamental role in AI is to ensure all models are helpful, honest, and harmless, so we firmly believe in unlocking the power of AI to build a better world. We have a learn-it-all culture that values perspective, growth, and innovation. We are customer-obsessed, action-oriented, and celebrate winning together.
At Appen, we are committed to creating an inclusive and diverse workplace. We are an equal opportunity employer that does not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Appen is seeking a highly skilled and motivated Data Engineer to join our dynamic team. In this role, you will use your extensive knowledge of software development to build and enhance complex systems and applications, contributing to the evolution of AI and machine learning.
Appen is a leader in AI enablement for critical tasks such as model improvement, supervision, and evaluation. To do this we leverage our global crowd of over one million skilled contractors, speaking over 180 languages and dialects, representing 130 countries. In addition, we utilize the industry's most advanced AI-assisted data annotation platform to collect and label various types of data like images, text, speech, audio, and video.
Our data is crucial for building and continuously improving the world's most innovative artificial intelligence systems and Appen is already trusted by the world's largest technology companies. Now with the explosion of interest in generative AI, Appen is helping leaders in automotive, financial services, retail, healthcare, and governments the confidence to deploy world-class AI products.
At Appen, we are purpose driven. Our fundamental role in AI is to ensure all models are helpful, honest, and harmless, so we firmly believe in unlocking the power of AI to build a better world. We have a learn-it-all culture that values perspective, growth, and innovation. We are customer-obsessed, action-oriented, and celebrate winning together.
At Appen, we are committed to creating an inclusive and diverse workplace. We are an equal opportunity employer that does not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Appen is seeking a highly skilled and motivated Data Engineer to join our dynamic team. In this role, you will use your extensive knowledge of software development to build and enhance complex systems and applications, contributing to the evolution of AI and machine learning.
Key Responsibilities:
- Create and maintain optimal data pipeline architecture.
- Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability.
- Build analytics tools that utilizes the data pipeline to provide actionable insights into customer delivered data, operational efficiency and other key business performance metrics.
- Optimize and improve our existing data products.
Qualifications :
- Bachelor's degree in computer science, Software Engineering, or a related field. A master's degree is a plus.
- 2-5 years of experience in data engineering background.
- Strong programming skills in Python/Java, excellent SQL skills.
- Experience with relational SQL and NoSQL databases.
- Expertise in AWS services EMR, RDS, Glue, Athena, S3, Data Pipeline, Redshift, Lambda, API Gateway.
- Experience with big data tools: Spark, Kafka.
- Experience with data streaming systems like spark-streaming, storm.
- Experience with data pipeline and workflow management tools: Airflow
- Experience with shell scripting.
- Ability to quickly understand and appreciate underlying business context, problems and objectives of analytical projects.
- Excellent time management skills
- Clear communication skills to run well defined analysis and produce reports.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
0
0
0
Category:
Engineering Jobs
Tags: Airflow APIs Architecture Athena AWS Big Data Computer Science Engineering Finance Generative AI Java Kafka Lambda Machine Learning NoSQL Python Redshift Shell scripting Spark SQL Streaming
Perks/benefits: Career development
Region:
Asia/Pacific
Country:
India
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Sr. Data Engineer jobsData Scientist II jobsStaff Data Scientist jobsBI Developer jobsStaff Machine Learning Engineer jobsPrincipal Data Engineer jobsData Manager jobsSenior AI Engineer jobsJunior Data Analyst jobsData Science Intern jobsData Science Manager jobsResearch Scientist jobsBusiness Data Analyst jobsPrincipal Software Engineer jobsData Specialist jobsLead Data Analyst jobsSoftware Engineer II jobsData Analyst Intern jobsSr. Data Scientist jobsData Engineer III jobsBI Analyst jobsJunior Data Engineer jobsDevOps Engineer jobsSoftware Engineer, Machine Learning jobsAI/ML Engineer jobs
Snowflake jobsEconomics jobsLinux jobsOpen Source jobsData Warehousing jobsComputer Vision jobsMLOps jobsGoogle Cloud jobsAirflow jobsNoSQL jobsRDBMS jobsKafka jobsBanking jobsHadoop jobsJavaScript jobsClassification jobsScala jobsScikit-learn jobsPhysics jobsKPIs jobsData warehouse jobsOracle jobsTerraform jobsStreaming jobsGitHub jobs
PostgreSQL jobsScrum jobsPySpark jobsR&D jobsLooker jobsPandas jobsSAS jobsCX jobsBigQuery jobsData Mining jobsDistributed Systems jobsJira jobsdbt jobsRobotics jobsIndustrial jobsRedshift jobsUnstructured data jobsReact jobsMicroservices jobsJenkins jobsData strategy jobsNumPy jobsE-commerce jobsPharma jobsGPT jobs