Large Language Model-Based Solutions cover art

Large Language Model-Based Solutions

How to Deliver Value with Cost-Effective Generative AI Applications

Preview
Subscribe now Free with 30-day trial
Offer ends on 14 April, 2026 at 23:59.
Prime logo
Pay ₹5/month for 2 months and ₹199/month after 2 months, Cancel anytime. Offer ends on 14 April 2026 at 23:59. Take this offer!
1 credit a month to use on any title to download and keep.
Listen to anything from the Plus Catalogue—thousands of Audible Originals, podcasts and audiobooks.
Download titles to your library and listen offline.
1 credit a month to use on any title to download and keep
Listen to anything from the Plus Catalogue—thousands of Audible Originals, podcasts and audiobooks
Download titles to your library and listen offline
₹199 per month after 30-day trial. Cancel anytime.

Large Language Model-Based Solutions

Written by: Shreyas Subramanian
Narrated by: Daniel Henning
Subscribe now Free with 30-day trial

Pay ₹5/month for 2 months and ₹199/month after 2 months, Cancel anytime. Offer ends on 14 April 2026 at 23:59.

₹199 per month after 30-day trial. Cancel anytime.

Buy Now for ₹469.00

Buy Now for ₹469.00

LIMITED TIME OFFER | Get 2 Months for ₹5/month

About this listen

In Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications, Principal Data Scientist at Amazon Web Services, Shreyas Subramanian, delivers a practical guide for developers and data scientists who wish to build and deploy cost-effective large language model (LLM)-based solutions. In the book, you'll find coverage of a wide range of key topics, including how to select a model, pre- and post-processing of data, prompt engineering, and instruction fine-tuning.

The author sheds light on techniques for optimizing inference, like model quantization and pruning, as well as different and affordable architectures for typical generative AI (GenAI) applications, including search systems, agent assists, and autonomous agents. You'll also find:

● Effective strategies to address the challenge of the high computational cost associated with LLMs

● Assistance with the complexities of building and deploying affordable generative AI apps, including tuning and inference techniques

● Selection criteria for choosing a model, with particular consideration given to compact, nimble, and domain-specific models

©2024 John Wiley & Sons, Inc. (P)2024 Ascent Audio
Computer Science
No reviews yet