Open Opportunities
Data Engineer
About The Position
Company Overview:
Cellebrite’s (Nasdaq: CLBT) mission is to enable its global customers to protect and save lives by enhancing digital investigations and intelligence gathering to accelerate justice in communities around the world. Cellebrite’s AI-powered Digital Investigation Platform enables customers to lawfully access, collect, analyze and share digital evidence in legally sanctioned investigations while preserving data privacy. Thousands of public safety organizations, intelligence agencies and businesses rely on Cellebrite’s digital forensic and investigative solutions—available via cloud, on-premises and hybrid deployments—to close cases faster and safeguard communities. To learn more, visit us at www.cellebrite.com, https://investors.cellebrite.com/investors and find us on social media @Cellebrite.
Position Overview:
We are assembling an elite, small-scale team of innovators committed to a transformative mission: advancing generative AI from conceptual breakthrough to tangible product reality. As a Senior Data Engineer, you will be the critical data backbone of our innovation engine, transforming raw data into the fuel that powers groundbreaking GenAI solutions, driving Cellebrite's digital intelligence capabilities to unprecedented heights.
Your Strategic Role
You are not just a data engineer – you are a strategic enabler of GenAI innovation. Your primary mission is to:
- Prepare, structure, and optimize data for cutting-edge GenAI project exploration
- Design data infrastructures that support rapid GenAI prototype development
- Uncover unique data insights that can spark transformative AI project ideas
- Create flexible, robust data pipelines that accelerate GenAI research and development
What Sets This Role Apart
- Data as the Foundation of AI Innovation
- You'll be working at the intersection of advanced data engineering and generative AI
- Your data solutions will directly enable the team's ability to experiment with and develop novel AI concepts
- Every data pipeline you design has the potential to unlock a breakthrough GenAI project
- Exploration and Innovation
- Conduct deep data exploration to identify potential GenAI application areas
- Work closely with AI researchers to understand data requirements for cutting-edge GenAI projects
Data Engineering Expertise
- Advanced skills in designing data architectures that support GenAI research
- Ability to work with diverse, complex datasets across multiple domains
- Expertise in preparing and transforming data for AI model training
- Proficiency in creating scalable, flexible data infrastructure
Technical Capabilities
- Deep understanding of data requirements for machine learning and generative AI
- Expertise in cloud-based data platforms
- Advanced skills in data integration, transformation, and pipeline development
- Ability to develop automated data processing solutions optimized for AI research
Research and Innovation Skills
- Proven ability to derive strategic insights from complex datasets
- Creative approach to data preparation and feature engineering
- Capacity to identify unique data opportunities for GenAI projects
- Strong experimental mindset with rigorous analytical capabilities
Requirements
- Degree in Computer Science, Data Science, or related field
- 5+ years of progressive data engineering experience
Demonstrated expertise in:
- Cloud platforms (AWS, Google Cloud, Azure)
- Big Data technologies
- Advanced SQL and NoSQL database systems
- Data pipeline development for AI/ML applications
- Performance optimization techniques
Technical Skill Requirements
- Expert-level SQL and database management
- Proficiency in Python, with strong data processing capabilities
- Experience in data warehousing and ETL processes
- Advanced knowledge of data modeling techniques
- Understanding of machine learning data preparation techniques
- Experience integrating with BigQuery – advantage