Senior AI/ML Engineer
Location: Pittsburgh, PA or San Francisco Bay Area
About Rockfish:
Rockfish Data is the industry’s first outcome-centric data generation platform helping organizations overcome data bottlenecks—like sparsity and privacy constraints— using high-quality synthetic data.
Position Overview:
We're looking for a Senior AI/ML Engineer who is ready to push the boundaries of generative modeling, privacy-preserving AI, and synthetic data fidelity. You will play a pivotal role in developing and operationalizing state-of-the-art Generative AI models that power enterprise use cases across industries—from observability to cybersecurity and IoT. You will also be involved in leading the design and implementation of data/model pipelines, LLM/NLP services, and interactive workflow tooling to democratize synthetic data generation across multiple use cases and industry verticals. This is a chance to define an emerging field and influence how the world builds with data when real-world access is limited or impossible.
Responsibilities:
Technical Development
Build production-scale generative models handling large-scale operational datasets across diverse domains.
Pioneer novel approaches to use machine learning to solve enterprise data challenges.
Research and development solutions to solve algorithmic and systems design challenges in data quality, fidelity, and statistical validity.
Pipeline orchestration: Design, implement, and operationalize workflows for data ingestion, model training, evaluation, and synthetic data generation.
Cross-Functional Impact
Work closely and hand-in-hand with Rockfish's platform engineering team and the Product team to accelerate the adoption of the Rockfish solution in the enterprise and public sector markets, and translate cutting-edge research into customer value.
Contribute to technical roadmap and architecture for next-generation data platforms.
Innovation & Research
Stay at the forefront of deep generative models and identify breakthrough opportunities
Evaluate and integrate emerging technologies (e.g., state-space models, diffusion models), and other novel architectures
Establish rigorous evaluation frameworks for synthetic data quality and utility
Contribute to open-source communities and represent Rockfish at technical conferences
Communication & Partnerships
Explain complex technical concepts to executives, customers, and investors.
Influence enterprise customers' AI strategies and synthetic data adoption.
Forge technical partnerships with cloud ,data warehouse, and ML platform providers.
Qualifications:
Required
4+ years building production ML systems and modern deep learning frameworks.
2+ years integrating and serving LLMs/NLP models (Hugging Face, OpenAI, custom transformer stacks).
Hands-on experience with PyTorch or TensorFlow in distributed training environments.
Proven track record of deploying complex ML projects from R&D to production.
Experience with cloud platforms (AWS, GCP, Azure) and MLOps best practices
Strong foundation in statistics, probability, and generative modeling techniques
Preferred
Ph.D. in computer science, electrical engineering, mathematics, statistics, or a related field.
Experience with generative models (e.g., GANs, VAEs, diffusion models, transformers),
Background in privacy-preserving ML techniques.
Track record of technical leadership in startup or high-growth environments.
Publications or contributions to ML research communities.
Why Rockfish
Green-field ownership — inform the design of a category-defining to our flagship product.
Direct customer impact — your work unblocks customer deals where bespoke data rules are a must.
Small team, big surface area — touch everything from model latency to pixel-perfect UI.
Competitive comp & early equity in a VC-backed, high-growth startup.
Ready to build the future of generative data?
If you are passionate about bringing to market world-class AI and ML solutions, Apply via careers@rockfish.ai or reach out through our site: https://www.rockfish.ai/contact-us