Data Engineer, Web Scraping - Warsaw, Poland
Warsaw, Poland
⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️
Point72
We invest in Discretionary Long/Short, Macro, and Systematic strategies. We’re inventing the future of finance by revolutionizing how we develop our people and how we use data to shape our thinking. Join our team to innovate, experiment, and be...A Career with Point72’s Market Intelligence Team
Point72’s Market Intelligence team is responsible for developing proprietary research products, providing data and research management services for investment teams, to support their pursuit of superior, risk-adjusted returns. We leverage innovative alternative data sources, advanced data analytics and technologies, as well as deep fundamental research to create high-quality compliant and differentiated research. Backed by the full resources of Point72, our sector-aligned teams collaborate to solve important research problems in partnership with the Firm’s investment and compliance professionals.
What you’ll do
As a Data Engineer focused on Web Scraping, you will be building solutions for processing big and unstructured data sets at Point72. You will be responsible for extracting and ingesting data from websites, using compliance-approved web crawling tools. In this role you will:
- Own the creation process of these tools, services, and workflows to improve crawl/scrape analysis, reports and data management
- Be responsible for testing the data and the scrape to ensure accuracy, quality, and compliance
- Own the process to identify and rectify any issues with breaks as well as scale scrapes as needed
What’s required
- Solid Python and SQL knowledge
- Experience in running automated programs at scale
- Working experience with cloud-based technologies
- Familiarity with Linux/UNIX, HTTP, HTML, JavaScript and Networking,
- Familiarity with common techniques and tools for crawling, extracting and processing data (e.g., Requests BeautifulSoup, Scrapy, Pandas, Selenium, Spark, etc.)
- Working knowledge of version control systems and open source practices
- Great analytical and problem-solving skills
- Great communication skills (written and spoken in English)
- Bachelor's Degree in Computer Science or a related field or the equivalent demonstrated experience
- Experience with extracting text from PDFs, images, applications, etc. is a plus
- Experience with system monitoring / administration tools is a plus
- Experience with working with applications designed to display archived web content is a plus
- Commitment to the highest ethical standards
- Prior experience with analyzing big data sets
We take care of our people
We invest in our people, their careers, health, and well-being. We want you to concentrate on the success and leave the rest to us. When you work here, we provide:
- Sports card
- Private life insurance
- Private medical and dental care, with vision allowance
- Private pension scheme
- Volunteer opportunities
- Support for employee-led affinity groups representing women, people of color and the LGBT+ community
- Business travel accident insurance
- Employee assistance program
- Educational assistance reimbursement
About Point72
Point72 Asset Management is a global firm led by Steven Cohen that invests in multiple asset classes and strategies worldwide. Resting on more than a quarter-century of investing experience, we seek to be the industry’s premier asset manager through delivering superior risk-adjusted returns, adhering to the highest ethical standards, and offering the greatest opportunities to the industry’s brightest talent. Our Warsaw office gives us access to world-class talent with a reputation for excellence and innovation. We’re looking to build an office of subject-matter experts whose fresh perspectives will help evolve our infrastructure and advance the capabilities of our teams. Learn more at Point72.com/Warsaw.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Big Data Computer Science Data Analytics Data management JavaScript Linux Open Source Pandas Python Research Selenium Spark SQL Testing Unstructured data
Perks/benefits: Career development Health care Medical leave
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.