具身多模态数据分析算法开发实习生
Tasks
- Analyze multi turn dialogue emotion transitions
- Build multi modal data quality metrics
- Clean deduplicate and filter text audio and dialogue corpora
- Clean parse and preprocess raw multimodal data
- Compute similarity and select high value samples
- Detect anomalies and remove duplicates
- Develop automated pre labeling pipeline
- Evaluate data quality and value
- Extract knowledge graphs and generate training corpora
- Implement automated data quality inspection
- Label dialect and conversational speech data
- Perform intent slot entity or sentiment annotation
- Perform sensor time synchronization
- Support online labeling and hard example mining
- Transcribe and proofread ASR outputs
Perks/Benefits
Skills/Tech-stack
ASR | Anomaly Detection | Audio Data | Audio Data Processing | Automated Data Labeling | Cloud processing | Corpus Engineering | Data Deduplication | Data Mining | Data Preprocessing | Data Processing | Data Quality | Data labeling | Data synchronization | Decoding | Dialogue Analytics | Ego4D | Entity recognition | FFmpeg | Intent detection | Knowledge graph | Machine Learning | Multimodal Data | Multimodal Processing | NLP | Named Entity Recognition | OpenCV | Point Cloud | Point Cloud Processing | PyTorch | Python | RAG | ROS | Retrieval-Augmented Generation | Sensor data | Sensor data synchronization | Sentiment Analysis | Similarity Analysis | Slot Filling | Speech Transcription | Text Cleaning | Time Synchronization | VLA | VLM | Video Processing
Education
Related jobs
-
自动驾驶数据闭环工程师-Data Infra CNY 25K-37KAI model | C++ | Data Mining | Data Quality | Data Quality EvaluationMid-level Full Time北京、苏州8h ago
-
Analyst, Data Science CNY 216K-264KArtificial Intelligence | Code review | Document Ingestion Pipelines | Document ingestion | Fine TuningMid-level Full TimeCN - AIA Financial Center Building, …1d ago
-
Senior-level Full TimeChina-Shanghai (Tianshan-W-Rd)1d ago
-
Lead Data Scientist CNY 360K-600KLanguage Models | Language Processing | Large Language Models | Machine Learning | Natural LanguageAdoption leave | Annual Medical Checkup | Birthday leave | Compassionate leave | Examination leaveSenior-level Full TimeChina-Shanghai (Tianshan-W-Rd)1d ago
-
Lead Data Scientist CNY 360K-600KData Analysis | Data Architecture | Language Models | Language Processing | Large Language ModelsAnnual Medical Checkup | Family care leave | Flexible benefits platform | Flexible working hours | Life insuranceSenior-level Full TimeChina-Shanghai (Tianshan-W-Rd)1d ago
-
Senior-level Full TimeChina-Shanghai (Tianshan-W-Rd)1d ago
-
Mid-level Full TimeHangzhou3d ago
-
Mid-level Full Time深圳3d ago
-
【27届实习】云原生Ai平台研发工程师-杭州 CNY 37K-37KArgo Workflow | Computer networks | Containers | Data Structures | GoEntry-level Internship杭州3d ago
-
【27届实习】数据挖掘工程师 CNY 25K-37KData Structures | Deep learning | Distributed machine learning | Go | JavaFull-time conversion opportunityEntry-level Internship Temporary上海3d ago
-
Entry-level Internship南京3d ago
-
Mid-level Full Time东莞3d ago
-
Ai算法工程师 CNY 180K-300KConvolutional Neural Networks | Data Mining | Data Warehouse | Data labeling | Deep learningMid-level Full Time东莞3d ago
-
Agile | Automated testing | C++ | CI/CD | Distributed SystemsCareer comeback program | Flexible working optionsSenior-level Full TimeShanghai, Mainland China3d ago
-
Mid-level Full TimeWuxi, Jiangsu, China4d ago
-
Senior-level Full TimeChina4d ago
-
Senior-level Full TimeChina4d ago
-
Senior Machine Learning Engineer CNY 360K-600KAWS | Agent architecture | Blender | ComfyUI | Context engineeringSenior-level Full TimeChina4d ago
-
Bash | Data Ingestion | Data Processing | Docker | GCPAsynchronous work culture | Friendly laid-back atmosphereMid-level Full TimeShanghai, China4d ago
-
Computer Graphics | Computer Vision | CoreML | Deep learning | Diffusion ModelsSenior-level Full TimeBeijing, Beijing, China5d ago
-
CUDA | DeepSpeed | Distributed Training | FSDP | Gradient CheckpointingEntry-level Full TimeBeijing, Beijing, China5d ago
-
Senior-level Full TimeBeijing, China5d ago
-
AI Computing Software Development Engineer, TensorRT CNY 144K-240KArtificial Intelligence | C# | C++ | Debugging | Deep learningSenior-level Full TimeChina, Shanghai5d ago
-
Medical - Real-World Evidence Data Scientist - Beijing CNY 240K-360KBiostatistics | Clinical Study Design | Clinical study | Data Management | Data cleaningMid-level Full TimeBeijing, China5d ago
-
Entry-level Internship深圳5d ago