Software Engineer, Data Infrastructure
Tasks
- Build ML data infrastructure
- Build evaluation pipelines
- Build training data pipelines
- Co design data systems with training and serving
- Design scalable high throughput data pipelines
- Ensure dataset quality standards
- Implement batching and GPU aware loading
- Implement dataset versioning
- Ingest preprocess augment data
- Integrate novel datasets with external vendors
- Load datasets for training
Perks/Benefits
Skills/Tech-stack
Audio Processing | Data Augmentation | Data Pipelines | Data Preprocessing | Data loading | Dataset versioning | GPU | Machine Learning | Multimodal Data | Python | SQL | Streaming Data | Video Processing
Education
Roles
Regions
Countries
States
Related jobs
-
Staff Applied Scientist USD 244K-320KAgentic Systems | Artificial Intelligence | Benchmarking | CI/CD | Computer VisionEmployee communities | Experience bonus | Hybrid work model | Wellness reimbursementSenior-level Full TimeSeattle, Washington, United States6h ago
-
Capacity Analysis | Cloud Computing | Continuous Improvement | Data Visualization | Data Warehousing401k | Dental insurance | Discounts | Health insurance | Paid leaveMid-level Full TimeUniversal City, CALIFORNIA, United States7h ago
-
AI Research Engineer USD 190K-280KDeep learning | Generative AI | Language Models | Language Processing | Large Language ModelsCareer development | Diversity and inclusion | Flexible work environmentMid-level Full TimeSeattle, Washington, United States; South San …7h ago
-
A/B | A/B Testing | AWS | Airflow | Amazon Redshift401k matching | Employee assistance program | Flexible time off | Flexible work arrangement | Paid HolidaysMid-level Full TimeRemote, US R10h ago
-
Data Scientist I (Prescriptive AI) USD 99K-135KCPLEX | DB2 | Data Warehousing | Discrete Event Simulation | Discrete eventCross training | Onsite Work Authorization SupportMid-level Full TimeLittle Rock, AR11h ago
-
Senior-level Full TimeIrving, TX11h ago
-
Associate AI Engineer USD 144K-180K.NET | APIs | ASPNet | AWS | Azure401k matching | Dental insurance | Hybrid work model | Medical insurance | Paid time offMid-level Full TimeIrving, TX R11h ago
-
Data Engineer-Secret Clearance Required USD 100K-127KAWS | AWS Glue | AWS Redshift | Azure | Azure Data401k match | Bereavement leave | Disability insurance | Employee assistance program | Employee discount programSenior-level Full TimeRemote - Nationwide, United States R13h ago
-
Sr AI Engineer USD 84K-105KC# | Deep learning | Digital Signal | Digital Signal Processing | Edge ComputingAccidental death and dismemberment | Commuter benefits | Dental insurance | Flexible spending account | Health savings accountSenior-level Full TimeColumbia, MARYLAND, United States13h ago
-
Applied Research Scientist / Engineer USD 175K-250KData Curation | Deep learning | Diffusion Models | Distributed Training | Domain AdaptationMid-level Full TimeNew York, NY, SF Bay Area, …14h ago
-
Machine Learning Engineer, Data Mining USD 144K-192KActive Learning | Batch inference | CI/CD | Data Augmentation | Data Curation401k match | Dental insurance | Health savings account | Life insurance | Medical insuranceSenior-level Full TimePittsburgh, Pennsylvania, United States; Remote U.S. R15h ago
-
Machine Learning Engineer, Data Mining USD 144K-192KActive Learning | Batch inference | CI/CD | Data Augmentation | Data Drift401k match | Dental insurance | Health insurance | Health savings account | Life insuranceSenior-level Full TimeBoston, Massachusetts, United States; Remote U.S. R15h ago
-
AI Engineer (GenAI & Integration) USD 130K-181KAI Agents | AI Governance | API Integration | Automation workflows | DeploymentMid-level Full TimeCenter, Center District, IL15h ago
-
Data Engineer USD 105K-130KAPIs | Data Governance | Data Modeling | Data Monitoring | Data Quality401k employer matching | Childcare reimbursement | Company events social hours | Company paid parking or MTS pass | Fertility treatment coverageSenior-level Full TimeSan Diego, CA, United States15h ago
-
Senior Software Engineer - San Francisco (Onsite) USD 130K-220KAWS | Amazon EMR | Amazon S3 | Apache Flink | Apache SparkFast-paced startup environment | Onsite work environment | Rapid hiring process feedback | Relocation supportSenior-level Full TimeSan Francisco, CA, US16h ago
-
SYSTEM ENGINEER - Computer Network Support - AI/ML - 6+ yrs of Experience - TS/SCI w/Poly clearance is required - ES A USD 136K-140KArtificial Intelligence | Confluence | Jira | LLM | Machine Learning401k retirement plan | Dental insurance | Life insurance | Medical insurance | Paid time offMid-level Full TimeFort George G Meade, United States16h ago
-
Bash | Data Pipelines | Distributed Systems | Docker | GCPAccess to cutting-edge technologies | Autonomy | Bonus | Collaborative culture | Distributed-first environmentMid-level Full TimeCanada R16h ago
-
Lead Data Engineer – Snowflake USD 170K-216KAmazon S3 | Apache Airflow | Azure Data | Azure Data Lake | Azure Data Lake StorageSenior-level Full TimeUnited States17h ago
-
Software Engineer 2/3-BY-SIG-02 USD 78K-250KAccumulo | BSON | Bigtable | Docker | HBase401k match | Diverse inclusive workplace | Employee referral programs | Flexible work arrangements | Mental health supportSenior-level Full TimeHanover, MD17h ago
-
IT Data Engineer USD 60K-65KANSI X12 | AWS | Azure | Azure Data | Azure Data Factory401k retirement plan | Disability coverage | Employee assistance program | Flexible spending account | Medical/Dental/Vision insuranceMid-level Full TimeUS - Remote R17h ago
-
Data Engineer III USD 134K-161KAccess Control | Analytics Cloud | Data Governance | Data Marts | Data Quality401k | Adoption Assistance | Dental insurance | Education reimbursement | FSASenior-level Full TimeIrvine, CA17h ago
-
GenAI Engineer III USD 110K-218KArtificial Intelligence | Containerization | Data Analysis | Data Pipelines | DockerProfessional developmentSenior-level Full TimeArlington/Rosslyn, Virginia, United States18h ago
-
Generative AI Engineer III USD 110K-218KArtificial Intelligence | Data Analysis | Data Pipelines | Docker | KubernetesDiscretionary annual incentive program | Mentorship | Professional developmentSenior-level Full TimeAustin, Texas, United States; Boston, Massachusetts, …18h ago
-
Lead Generative AI Data Engineer III USD 159K-265KArtificial Intelligence | Data Pipelines | Docker | Generative AI | KubernetesSenior-level Full TimeAustin, Texas, United States; Boston, Massachusetts, …18h ago
-
Lead Generative AI Data Engineer III USD 159K-265KArtificial Intelligence | Data Pipelines | Generative AI | Language Models | Language ProcessingSenior-level Full TimeAtlanta, Georgia, United States; New York, …18h ago