Save Your Seat

The MLOps Live Webinar Series

Session #17

Save Your Seat

Tuesday 14th of December 2021 - 9am PDT / 12 noon EDT / 6pm CET

Session #17: Scaling NLP Pipelines at IHS Markit

The data science team at IHS Markit will be sharing practical advice on building sophisticated NLP pipelines that work at scale. Using a robust and automated MLOps process, they run complex models that make massive amounts of unstructured data searchable and indexable.

In this session, they will share their journey with MLOps and provide practical advice for other data science teams looking to:

  • Ingest, prepare, classify and index structured and unstructured data (in this case, PDFs and Images)
  • Handle terabytes of data in hours, not months
  • Make deployment of models seamless by working in one unified research and production environment
  • Leverage CI/CD for ML
  • Allow for sharing and reuse of components across projects and teams
  • Utilize auto-scaling serverless functions to abstract away infrastructure complexities
  • Build rapidly, iterate faster and focus on the business logic and not the underlying infrastructure


Nick and Yaron will share their approach to automating the NLP pipeline end to end. They’ll also touch on how to use Iguazio and the MLRun open source framework, which comes with capabilities such as Spot integration and Serving Graphs, to reduce costs, accelerate and simplify the data science process.

If you cannot attend the live session, you can still register and receive a link to the recording after the event.

Presented By
Yaron Haviv

Co-Founder and CTO, Iguazio

Nick Brown

Senior Data Scientist, IHS Markit