Data Scientist

Data Scientist

Develop a state-of-the-art product. Make sense of the future. Use data to drive business.

At CB Insights we build products that help clients make sense of the future and drive their businesses forward using data. Our system retrieves large amounts of structured and unstructured data and uses scientific methods to extract knowledge and insights from that data. We present those analytics through a sophisticated, dynamic user interface which enables our clients to find answers to their most important questions.

The Role You’ll Play:

As a Data Scientist at CB Insights, you’ll be part of a mission to build an accurate, robust, and scalable data intelligence machine that powers clients’ decision-making. You’ll explore and work with large datasets from diverse sources. You’ll use machine learning and other quantitative techniques to build client-facing data products and internal ML infrastructure. 

This role calls for a versatile data scientist: 50% ML engineer, 30% data analysis, 15% prompt engineer, 5% data engineer. We expect end-to-end ownership from research, and POC to productionization.

About the team

You will join an R&D team with a proven track record of shipping high-impact data products and building scalable knowledge extraction pipelines. We’re curious, pragmatic makers who care deeply about solving the right customer problems with the right solution. We move fast with new technologies when they’re the right fit. We want you to understand the drivers of our business and explain your approach effectively to peers and stakeholders.

Our team values:

  • stay curious try new things
  • long-term orientation
  • be bold & move fast
  • make data-informed decisions
  • openness, honesty, healthy conflicts
  • robust execution

Your Main Tasks:

  • Research and build ML/AI pipelines to extract knowledge and insights from high-volume high-velocity contextual data
  • Research and implement quantitative frameworks to measure, evaluate, and predict performance and key attributes for companies, markets, products, and technologies
  • Research and build ML/AI solutions to discover relationships and structures among products, companies, and markets
  • Scout new data sources and derive new data products/features from them
  • Initiate and engage in brainstorms, code reviews, and deep dives with peers to maximize creativity, rigor, and quality of solutions
  • Work closely with Product and Design to find and validate the best-fit solution for a customer problem; present to stakeholders and support key decision-making

What you bring to the table:

  • Advanced degree in computer science, statistics, economics or other quantitative fields
  • Solid foundation in statistics & ML
  • 3+ years of professional experience working on ML and analytics projects with modern data infrastructure. LLM finetuning and large-scale deployment are a plus
  • Excellent analytical and problem-solving skills
  • Write production-ready Python and SQL
  • Communicate concisely and clearly, in both verbal and written forms

You’ll be successful here if you are/have:

  • Own problems end-to-end, and are willing to roll up your sleeves and pick up whatever knowledge you’re missing to get the job done
  • Love learning and experimentation
  • Thrive in a fast-paced environment with high uncertainty
  • Collaborate effectively and advocate for the best values and practices
Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.