PINGDOM_CHECK

#ExtractSummit2026 The world's largest web scraping conference returns. Austin Oct 7–8 · Dublin Nov 10–11.

Login Try Zyte API Contact Sales

Unblocking and Extraction
Zyte API
The ultimate API for web scraping. Avoid website bans and access a headless browser or AI Parsing
Ban Handling
Headless Browser
AI Extraction
SERP
Enterprise
Documentation Support
Hosting and Deployment
Scrapy Cloud
Run, monitor, and control your Scrapy spiders however you want to.
Coding Agent Add-Ons
Agentic Web Data
Plugins that give coding agents the context to build production Scrapy projects. Starts with Claude Code.
Data Services
Pricing
Browse
Subscribe
- NewsletterSwiftly delivered
- Discord communityExtract Data community
Product and E-commerce
From e-commerce and online marketplaces
Data for AI
Collect and structure web data to feed AI
Job Posting
From job boards and recruitment websites
Real Estate
From Listings portals and specialist websites
News and Article
From online publishers and news websites
Search
Search engine results page data (SERP)
Social Media
From social media platforms online
Meet Zyte
Our story, people and values
Contact us
Get in touch
Support
Knowledge base and raise support tickets
Terms and Policies
Accept our terms and policies
Open Source
Our open source projects and contributions
Web Data Compliance
Guidelines and resources for compliant web data collection
Join the team building the future of web data
We're Hiring
Trust Center
Security, compliance & certifications

Login Try Zyte API Contact Sales

Blog WebinarsHow to serve LLM efficiently with open source models and libraries

WebinarHow To Data science Future of the web Large Language Models (LLMs)The new oil

How to serve LLM efficiently with open source models and libraries

A

Arnold Alexander

·

1 min read · July 16, 2024

Get insights on efficiently serving large language models, focusing on a throughput-optimized regime.

Join us for an insightful webinar with Konstantin Lopukhin, Head of Data Science at Zyte. This session is specifically designed for AI enthusiasts, data scientists, and machine learning engineers who are looking to optimize their handling of large language models.

In this session, you will learn about:

Techniques such as quantization, continuous batching, and speculative decoding to enhance efficiency.
The pros and cons of various implementations, including exllamav2, vllm, and TensorRT-LLM.
Guidance on selecting the best approach based on model size, available hardware, and target performance metrics.

Whether you are looking to improve your current model serving strategies or planning to implement new ones, this webinar will equip you with the insights and practical advice needed to achieve a throughput-optimized regime.

For any follow-up questions after watching the webinar, join our Discord community and engage directly with the team. We are a thriving community of 3000+ web scraping enthusiasts, committed to sharing insights, learning and exploring new technologies, and advancing in web scraping.

More webinars

Keep watching

All webinars →

2026 Web Scraping Industry Report by Zyte

2026 Web Scraping Industry Report by Zyte

A practical walkthrough of the Web Scraping Industry Report 2026, covering how AI, automation, and access controls are reshaping web data collection at scale.

2 min read

Master modern unblocking tactics against the latest anti-bot defenses

Master modern unblocking tactics against the latest anti-bot defenses

Learn how to prepare for modern anti-bot systems with advanced unblocking tactics.

2 min read

Scrape, Analyze & Visualize Web Data with Streamlit

Scrape, Analyze & Visualize Web Data with Streamlit

Join Hyder Khan | Data Engineer, @ Flipdish as he shares how to extract, clean, analyze, and visualize web data using a seamless workflow with Streamlit.

1 min read

Services

Zyte Data

Coding tools & hacks straight to your inbox. Bi-weekly dosage of all things code.

Web Scraping API

Zyte API

Coding tools & hacks straight to your inbox. Bi-weekly dosage of all things code.

Developers

Zyte Developers

Coding tools & hacks straight to your inbox. Bi-weekly dosage of all things code.

Product & E-commerce
Data for AI
Job Posting
Real Estate
News & Articles
Search
Social Media

Blog
Learn
Case Studies
Webinars
White Papers
Join our community
Documentation

Meet Zyte
Contact us
Jobs
Support
Terms and Policies
Trust Center
Do not sell
Cookie settings

Web Data Compliance
Open Source
What is Web Scraping
Web Scraping in Python: Ultimate Guide
Stop getting blocked, start scraping

Most loved workplace certificate

Zyte reward

G2 reward

G2 reward

G2 reward

X Facebook Instagram YouTube LinkedIn Discord

© Zyte Group Limited 2026