Titre du poste ou emplacement

Senior ML Engineer

Invoca - 3 emplois

Toronto, ON

Publié il y a 15 jours

Détails de l'emploi :

Télétravail
152 000 $ - 228 000 $ / année
Temps plein
Exécutif

Avantages :

Assurance maladie
Congés payés
Programmes de bien-être
Options d'achat d'actions

Senior ML Engineer

About Invoca

Invoca is an AI-powered revenue execution platform that brings together marketing, commerce, and contact center teams to turn every customer interaction into measurable, profitable growth. Join our dynamic, fast-growing team, where innovation and collaboration are at the core of our culture.

About the Team

The Data Platform team owns the full ML lifecycle at Invoca, from model training and fine-tuning through inference optimization and production APIs. We move quickly, swarm on hard problems, and care deeply about code quality, reliability, and each other's growth. Learn more on our blog or check out our open source projects.

About the Role

We're hiring a Senior ML Engineer to own the productionization layer of Invoca's ML stack — model serving, inference optimization, fine-tuning, and the APIs and pipelines that tie it all together. You'll be a primary driver of the infrastructure powering our Context Engine and agentic AI workflows, working closely with Data Scientists, Data Engineers, and Applied AI Engineers.

Core Focus & Primary Ownership

  • Lead End-to-End MLOps and Productionization: Architect, implement, and maintain CI/CD pipelines for ML artifacts — including model evaluation, versioning, and automated deployment. Serve as the primary SME for operational excellence across the Invoca ML stack.
  • Design and Optimize SLM/LLM Deployment: Own the full inference infrastructure: model serving on Triton Inference Server, Baseten, and Kubernetes-based GPU infrastructure. Profile and tune for low latency and high throughput, and build robust, scalable APIs for internal and external model access.

Broader Contributions

  • Fine-Tune Language Models: Apply parameter-efficient fine-tuning methods (LoRA, QLoRA, PEFT) to adapt transformer-based SLMs and LLMs for high-impact NLP applications in conversation intelligence.
  • Evolve ML Infrastructure: Contribute to model training infrastructure, data pipelines, and data lake foundations to keep the systems powering our models reliable and scalable.
  • Collaborate Across Teams: Partner closely with Data Scientists, Data Engineers, and Applied AI Engineers to build the foundational ML systems behind Invoca's agentic AI products.
  • Deliver Customer Value: Work with product and engineering to understand customer needs and ship ML solutions that make a measurable difference.

What You Bring

  • 5+ years of ML Engineering experience with a strong production focus
  • Advanced Python and deep learning proficiency (PyTorch, HuggingFace Transformers, spaCy)
  • Demonstrated track record deploying and maintaining transformer-based NLP models in production
  • Hands-on experience fine-tuning SLMs/LLMs (LoRA, QLoRA, PEFT) and optimizing models via quantization, batching, and throughput tuning
  • Proficiency with inference infrastructure: Triton, Baseten, vLLM, TGI, SageMaker, Vertex AI, or similar
  • Experience building production-grade APIs that expose ML models to downstream consumers
  • Familiarity with MLOps tooling, model monitoring, and eval platforms (Braintrust, MLflow, or equivalent)
  • B.S. in Computer Science, Engineering, Statistics, or equivalent; advanced degree a plus
  • Familiarity with RLHF or preference training is a bonus

Partager un emploi :

Foire aux questions