Data Science Engineer

Data Science Engineer

Essential Duties & Functions: 

  • Overseeing data functions such as ingestion of structured/unstructured data, transformation, standardization, and QA to build robust and automated data pipelines.

  • Document and maintain robust processes that move large amounts of complex data between multiple data storage systems, including text extracts, RDBMS, and parquet formats.

  • Develop and maintain data pipelines specifically for NLP and Generative AI models, ensuring data is properly preprocessed and transformed for model training and deployment, utilizing Snowflake for scalable data warehousing solutions where applicable.

  • Perform deep-dive analyses using SQL & Python, leveraging data science and big data tools to garner actionable insights and identify and clean complex data quality issues.

  • Collaborate with AI/ML teams to support the training, fine-tuning, and deployment of NLP and Generative AI models. 

  • Utilize Snowflake for efficient storage and retrieval of large datasets required for these models.

  • Work with SQL, Python, and Apache Spark to perform iterative analysis and integrate into data ETL/pipelines.

  • Implement and optimize NLP model pipelines using frameworks such as TensorFlow, PyTorch, or Hugging Face Transformers, integrating them with existing data infrastructures, including Snowflake for enhanced data management and accessibility.

  • Monitoring, reviewing, and analyzing inbound and outbound data.

  • Proactively identifying issues and trends through data analysis and manipulation.

  • Analyze and refine text data, including tokenization, embedding generation, and handling large-scale language models, to enhance NLP model performance, leveraging Snowflake’s capabilities for managing and processing large text datasets.

  • Presenting findings in reports and utilizing various visualization techniques and tools like Power BI and Python data visualization tools.

  • Communicating effectively with cross-functional teams.

  • Support the integration of NLP models into business applications, ensuring that they deliver actionable insights and meet performance benchmarks, with Snowflake serving as a core component for data storage and analytics.

  • Creating and maintaining code through GitHub repository for change control.

  • Supporting off-hours data processing and emergency requests.

Experience, Qualifications, Knowledge and Skills: 

  • Advanced Python skills, including popular data science libraries such as Pandas, Numpy, Matplotlib, or similar.

  • Experience with NLP libraries and frameworks such as SpaCy, NLTK, or Hugging Face Transformers.

  • Advanced SQL skills.

  • Experience with Snowflake for data warehousing, including integration with other data processing tools and platforms.

  • Experience with big data using Spark (PySpark, Scala) and knowledge of Spark Internals.

  • Experience working with Generative AI models, including training and fine-tuning language models like GPT.

  • Experience with AWS.

  • Bachelor’s degree (Statistics, Math, Computer Science, or related field) and a minimum of 2 to 5 years related experience and/or training; or equivalent combination of education and experience as a data engineer/analyst/scientist.

  • Strong health care data knowledge (medical claims data, clinical data, pharmacy data, and eligibility data) preferred.

  • Experience in deploying NLP models in production environments, particularly in industries such as healthcare or finance, utilizing Snowflake for scalable data solutions.

  • Experience with version control software (Git preferred), agile development experience, and knowledge of design patterns.

  • Experience working independently, contributing as a member of the team, and being results-driven.

  • Ability to communicate complex NLP and AI concepts to non-technical stakeholders in a clear and concise manner.

  • Strong presentation skills to explain and present advanced statistical methods using non-technical language to key business stakeholders.

Physical Demands: 

● Sedentary work - Exerting up to 10 pounds of force occasionally, and/or a negligible amount of force frequently to lift, carry, push, pull or otherwise move objects in daily work use (laptop, monitors, et. al). Sedentary work involves sitting most of the time. Use of keyboards (typing) and exposure to computer screens occurs daily. Pleasant work environment in office locations with occasional noise or dust.

● The physical demands described here are representative of those that must be met by an employee to successfully perform the essential functions of this job. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions. 

● While performing the duties of this job, the employee is regularly required to stand; walk; sit; use hands; reach with hands and arms; think; and talk or hear (multi-channel, two way communication during work hours is required).

Location and Workplace Flexibility: We have offices in Atlanta GA, Boston MA, Morristown NJ, Plano TX, St. Louis MO, St. Petersburg FL, and Hyderabad, India. We foster a hybrid and remote friendly culture and all of our employee's work locations are based on the needs of the position and determined by the Leadership team. In-office work and activities, if applicable, vary based on the work and team objectives in ac­­­­­­­­­­­­­­cordance with Company policies. 

­­­­­­­­­­­

 

Zelis is modernizing the healthcare financial experience by providing a connected platform that bridges the gaps and aligns interests across payers, providers, and healthcare consumers. This platform serves more than 750 payers, including the top 5 national health plans, BCBS insurers, regional health plans, TPAs and self-insured employers, and millions of healthcare providers and consumers. Zelis sees across the system to identify, optimize, and solve problems holistically with technology built by healthcare experts – driving real, measurable results for clients.

Commitment to Diversity, Equity, Inclusion, and Belonging 
At Zelis, we champion diversity, equity, inclusion, and belonging in all aspects of our operations. We embrace the power of diversity and create an environment where people can bring their authentic and best selves to work. We know that a sense of belonging is key not only to your success at Zelis, but also to your ability to bring your best each day.

Equal Employment Opportunity  
Zelis is proud to be an equal opportunity employer. All applicants will receive consideration for employment without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. 

We encourage members of traditionally underrepresented communities to apply, even if you do not believe you 100% fit the qualifications of the position, including women, LGBTQIA people, people of color, and people with disabilities.  

Accessibility Support 

We are dedicated to ensuring our application process is accessible to all candidates. If you are a qualified individual with a disability or a disabled veteran and require a reasonable accommodation with any part of the application and/or interview process, please email TalentAcquisition@zelis.com   

SCAM ALERT: There is an active nationwide employment scam which is now using Zelis to garner personal information or financial scams. This site is secure, and any applications made here are with our legitimate partner. If you’re contacted by a Zelis Recruiter, please ensure whomever is contacting you truly represents Zelis Healthcare. We will never asked for the exchange of any money or credit card details during the recruitment process. Please be aware of any suspicious email activity from people who could be pretending to be recruiters or senior professionals at Zelis.

Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.