PINGDOM_CHECK

Web Scraping Copilot is live. Build Scrapy spiders 3× faster, free in VS Code.

Install Now
  • Data Services
  • Pricing
  • Login
    Sign up👋 Contact Sales

Zyte Developers

Coding tools & hacks straight to your inbox

Become part of the community and receive a bi-weekly dosage of all things code.

Join us
    • Zyte Data
    • News & Articles
    • Search
    • Social Media
    • Product
    • Data for AI
    • Job Posting
    • Real Estate
    • Zyte API - Ban Handling
    • Zyte API - Headless Browser
    • Zyte API - AI Extraction
    • Web Scraping Copilot
    • Zyte API Enterprise
    • Scrapy Cloud
    • Solution Overview
    • Blog
    • Webinars
    • Case Studies
    • White Papers
    • Documentation
    • Web Scraping Maturity Self-Assesment
    • Web Data compliance
    • Meet Zyte
    • Jobs
    • Terms and Policies
    • Trust Center
    • Support
    • Contact us
    • Pricing
    • Do not sell
    • Cookie settings
    • Sign up
    • Talk to us
    • Cost estimator
ON-DEMAND WEBINAR

How to serve LLM efficiently with open source models and libraries

Get insights on efficiently serving large language models, focusing on a throughput-optimized regime.

Join us for an insightful webinar with Konstantin Lopukhin, Head of Data Science at Zyte. This session is specifically designed for AI enthusiasts, data scientists, and machine learning engineers who are looking to optimize their handling of large language models.


In this session, you will learn about:


  • Techniques such as quantization, continuous batching, and speculative decoding to enhance efficiency.

  • The pros and cons of various implementations, including exllamav2, vllm, and TensorRT-LLM.

  • Guidance on selecting the best approach based on model size, available hardware, and target performance metrics.


Whether you are looking to improve your current model serving strategies or planning to implement new ones, this webinar will equip you with the insights and practical advice needed to achieve a throughput-optimized regime.


For any follow-up questions after watching the webinar, join our Discord community and engage directly with the team. We are a thriving community of 3000+ web scraping enthusiasts, committed to sharing insights, learning and exploring new technologies, and advancing in web scraping.

G2.com

Capterra.com

Proxyway.com

EWDCI logoMost loved workplace certificateZyte rewardISO 27001 iconG2 rewardG2 rewardG2 reward

© Zyte Group Limited 2026