搜索与数据工程实习生(Ai大模型方向)
Tasks
- Analyze anti scraping mechanisms
- Build RAG data pipelines
- Build distributed crawler systems
- Build proxy pool and task scheduling
- Build search indexing and retrieval
- Build training datasets
- Clean and parse data
- Collect data from public internet sources
- Control access frequency
- Improve scalability and runtime efficiency
- Monitor crawl exceptions
- Optimize RAG retrieval results
- Optimize crawl pipeline stability
- Optimize crawl strategies and request patterns
- Optimize search retrieval performance
- Parse dynamic web pages
- Scrape web and document data sources
- Structure and process data for knowledge base
- Support data annotation and effect analysis
- Support search quality evaluation
- Update search data
Perks/Benefits
- N/A
Skills/Tech-stack
AJAX | Anti-scraping | CSS selector | Cookie | Distributed crawling | Dynamic Web | Dynamic Web Page Parsing | Elasticsearch | Embedding | Faiss | Feature analysis | HTML | HTTP | HTTPS | Kafka | Knowledge Base | Linux | Milvus | MongoDB | MySQL | OpenSearch | Playwright | Proxy Pool | Python | RAG | REST API | Rate Limiting | Redis | Regular Expression | Request Feature Analysis | Scrapy | Selenium | Session | Task Scheduling | Vector Search | Web Scraping | XPath
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Related jobs
-
Entry-level Internship深圳1d ago
-
Intern CNY 28K-50KAutomated testing | Continuous integration | Deep learning | Java | Knowledge graphsEntry-level InternshipBeijing,Beijing,China2d ago
-
Entry-level Internship Part TimeWuxi, Jiangsu, China2d ago
-
Data Tech Development Manager CNY 240K-360KApache Flink | Apache Spark | Auto Scaling | Batch Processing | Cloud servicesContinuous professional development | Flexible working | Inclusive workplace | Opportunities for growthMid-level Full TimeShanghai, Shanghai, China2d ago
-
GenAI & Software Engineering Intern CNY 60K-60KActiveMQ | Agentic AI | Amazon Web Services | Azure | Azure ServiceCanteen snacks | Community volunteer opportunities | Employee clubs and classes | On-site meals | Toastmasters International chapterEntry-level InternshipWuxi, CN3d ago
-
GenAI & Software Engineering Intern CNY 60K-60KAI Agents | AWS | ActiveMQ | Azure | Azure ServiceCanteen | Clubs and sports groups | Community volunteering | On-site meals | SnacksEntry-level InternshipWuxi, CN3d ago
-
Ai算法实习生(振动与力学方向) CNY 25K-37KAPI Integration | Convolutional Neural Network | Keras | Langchain | Neural NetworkEntry-level Internship深圳3d ago
-
Algorithm Engineer Intern CNY 38K-50KAlgorithms | C++ | Computer Vision | Data Structures | Deep learningHands-on experience | Healthcare innovation experience | Industry mentorship | MentorshipEntry-level Internship Part TimeShenyang - PIC, China4d ago
-
Entry-level Internship北京、上海4d ago
-
Entry-level Internship北京、上海4d ago
-
Entry-level Internship上海4d ago
-
Entry-level Internship上海4d ago
-
具身智能数据开发实习生 CNY 25K-37KAPI Development | Algorithms | Data Structures | Data Transformation | Data VisualizationEntry-level Internship上海4d ago
-
大数据工程开发实习生 CNY 50K-50KArgo Workflows | Backend Development | Data Analysis | Data Ingestion | Data QualityEntry-level Internship上海4d ago
-
具身多模态数据分析算法开发实习生 CNY 25K-37KASR | Anomaly Detection | Automatic Speech Recognition | Cloud processing | Computer VisionInternship experience | MentorshipEntry-level Internship上海4d ago
-
Senior-level Full Time广州4d ago
-
AI Intern - LLM CNY 25K-37KData Annotation | Fine Tuning | Langchain | Language Processing | Machine LearningEntry-level Full Time InternshipBeijing, Beijing, China4d ago
-
Data Science, Intern CNY 38K-50KData Preprocessing | Data Transformation | Data Validation | Data Visualization | Data cleaningEntry-level InternshipZhuhai, Guangdong5d ago
-
AWS | Access Control | Agentic Workflows | Auditability | AzureMid-level Full TimeGuangzhou, Guangdong, China R6d ago
-
AWS | Agent Orchestration | Agent systems | Azure | Cloud platformMid-level Full TimeGuangzhou, Guangdong, China6d ago
-
Computer Vision | Digital Twin | Domain Randomization | Embedded AI | Isaac SimEntry-level InternshipChina, Shanghai9d ago
-
Embodied AI Intern CNY 45K-50KC++ | Computer Vision | Deep learning | Gazebo | Isaac SimHands on industry scale data annotation experience | Onsite work three days per week | Structured mentoringEntry-level Internship Part TimeShanghai, China9d ago
-
CI/CD | Docker | ETL | FastAPI | FlaskEntry-level InternshipShanghai, YANGPU, China9d ago
-
C++ | CUDA | Control Systems | Isaac Lab | Isaac ROSEntry-level Full Time InternshipChina, Shanghai10d ago
-
C++ | CUDA | Isaac Lab | Isaac ROS | Isaac SimEntry-level InternshipChina, Shanghai10d ago