Staff Data Scientist, Metrics Quality

Staff Data Scientist, Metrics Quality

What you’ll do:

  • Analyze: Root causes of discrepancies in our critical externally reported and internally used metrics, and estimate the impact of data inaccuracies on our business decisions through deep-dives and strategic analyses. You will work to establish a consistent ground truth for our data, thereby enhancing its integrity and trustworthiness.
  • Investigation Support: Serve as a key point of contact and final sign-off for resolution of data anomalies and inconsistencies. Collaborate with client engineers and data warehouse teams to drive investigations, identify potential root causes and define success metrics for these inquiries.
  • Opportunity Sizing and Analysis: Write clear, actionable analyses that help teams identify areas of improvement to our metrics quality. For example, quantify how much user activity is underestimated due to networking issues, and suggest how addressing these can enhance overall metric accuracy.
  • Data Controls: Develop and implement robust data quality checkers and alerts, ensuring they provide high signal without excess noise
  • Streamline Communication: Partner with TPMs to ensure that quality issues and best practices are communicated effectively within the company. This includes enhancing key processes such as SOX compliance and ensuring ML teams have access to the best data available. 
  • Leadership: In this role, you’ll have the freedom to target opportunities you deem most critical and to set the strategic direction of the team. Lead by example to identify priority areas and elevate your colleagues' data capabilities by sharing insights and best practices.

 

What we’re looking for:

  • 8+ years of combined post-graduate academic and industry experience working with data to solve real-world problems on web-scale data.
  • Proven ability to identify and solve data integrity issues in real-world environments. 
  • Expertise in at least one scripting language, ideally Python or R.  
  • Proficiency in SQL/Hive, with a strong ability to manipulate and extract value from large datasets.  Airflow and SparkSQL experience are also valuable.
  • Rigorous analytical skills, with a meticulous approach to validating results and ensuring data reliability.  
  • Excellent communication skills, with proven experience leading initiatives across multiple product areas and effectively conveying findings to leadership and product teams.  
  • Demonstrated leadership in managing key technical projects, significantly influencing the scope and enhancing the output of team efforts.

 

Relocation Statement:

  • This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model.

 

In-Office Requirement Statement:

  • We let the type of work you do guide the collaboration style. That means we're not always working in an office, but we continue to gather for key moments of collaboration and connection.

 

#LI-HYBRID 

#LI-NM4

Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.