IMMIGRATION Sr. Data Scientist II - Job ID 556
Houston, TX
IMO Health
From clinical terminology to streamlined workflows to data standardization, we enable insights that help improve patient care across the healthcare ecosystem.Specific Duties Include:
- Specific duties include: analyze and process textual data for bioinformatics applications using Melax CLAMP software kit and clinical NLP techniques; design, customize, and extend existing Melax software suite and web-service applications according to Melax product needs and customer requirements; develop, maintain, and improve NLP applications that process unstructured biomedical texts into structured and searchable information; modify and improve current Melax products by developing and incorporating the cutting-edge machine learning and deep learning algorithms and techniques for enhanced performance and usability; communicate with customers, analyze their NLP needs and requirements, deliver products and projects, and provide assistance; work within the NLP development team to develop NLP modules in different programming or scripting languages such as Java, JavaScript. J2EE, HTML; conduct pre-processing and quality analyses for textual data inputs and performance validation for NLP output; create systematic testing, error-checking procedures, and user manuals; conduct customer consultation and technical support on NLP training, installation, development, and deployment; share knowledge with team members and across the organization on topics including new and emerging NLP methods and technologies; build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS ‘big data’ technologies; and build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics. Must take and pass Python/Java coding test to solve one NLP algorithm problem. Option to work remotely 60% of the time.
Position Requires:
- Bachelor’s degree, or foreign equivalent, in Computer Information Systems, Informatics, or a closely related field of study, plus 5 years of experience in the job offered, or as an NLP Developer, NLP Data Engineer, NLP Data Scientist, Research Assistant, or a closely related NLP position. Must have 5 years of experience in the following: developing NLP applications and building machine learning models; developing ETL pipelines and processes in big data environments; deploying, maintaining, versioning, and A/B testing machine learning models; working in at least one of these databases: AWS Redshift, Oracle, SQL Server, or MySQL; using SQL to write complex queries across large volumes of data; developing and deploying full-stack solutions in Python; using and following standardized development practices and tools, including TFS/GIT, code standards, and process standards; writing unit tests using standard unit test frameworks; and working with statistical techniques, concepts, methods, and approaches, and working with their application; using multivariate calculus and linear algebra. Must have 3 years of experience with the following: working with AWS tools including Lambda and Sagemaker; creating and using process documentation and workflows; working with TensorFlow and TensorFlow Serving; working with statistical modeling using R or Matlabl; using big data frameworks such as Spark/pySpark; and working with Tableau, Looker, Qlikview, R Shiny or similar data visualization tools. Must also have 2 years of experience using infrastructure-as-code tools, like terraform. Must take and pass Python/Java coding test to solve one NLP algorithm problem. Option to work remotely 60% of the time
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: A/B testing AWS Big Data Bioinformatics Clinical NLP Data visualization Deep Learning ETL Git Java JavaScript Lambda Linear algebra Looker Machine Learning ML models MySQL NLP Oracle Pipelines PySpark Python QlikView R Redshift Research SageMaker Spark SQL Statistical modeling Statistics Tableau TensorFlow Terraform Testing
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.