Build a Large Language Model (From Scratch)

Build a Large Language Model (From Scratch)
Author :
Publisher : Manning
Total Pages : 0
Release :
ISBN-10 : 1633437167
ISBN-13 : 9781633437166
Rating : 4/5 (166 Downloads)

Book Synopsis Build a Large Language Model (From Scratch) by : Sebastian Raschka

Download or read book Build a Large Language Model (From Scratch) written by Sebastian Raschka and published by Manning. This book was released on 2024-08-27 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: Learn how to create, train, and tweak large language models (LLMs) by building one from the ground up! In Build a Large Language Model (from Scratch), you’ll discover how LLMs work from the inside out. In this insightful book, bestselling author Sebastian Raschka guides you step by step through creating your own LLM, explaining each stage with clear text, diagrams, and examples. You’ll go from the initial design and creation to pretraining on a general corpus, all the way to finetuning for specific tasks. Build a Large Language Model (from Scratch) teaches you how to: Plan and code all the parts of an LLM Prepare a dataset suitable for LLM training Finetune LLMs for text classification and with your own data Use human feedback to ensure your LLM follows instructions Load pretrained weights into an LLM The large language models (LLMs) that power cutting-edge AI tools like ChatGPT, Bard, and Copilot seem like a miracle, but they’re not magic. This book demystifies LLMs by helping you build your own from scratch. You’ll get a unique and valuable insight into how LLMs work, learn how to evaluate their quality, and pick up concrete techniques to finetune and improve them. The process you use to train and develop your own small-but-functional model in this book follows the same steps used to deliver huge-scale foundation models like GPT-4. Your small-scale LLM can be developed on an ordinary laptop, and you’ll be able to use it as your own personal assistant. Purchase of the print book includes a free eBook in PDF and ePub formats from Manning Publications. About the book Build a Large Language Model (from Scratch) is a one-of-a-kind guide to building your own working LLM. In it, machine learning expert and author Sebastian Raschka reveals how LLMs work under the hood, tearing the lid off the Generative AI black box. The book is filled with practical insights into constructing LLMs, including building a data loading pipeline, assembling their internal building blocks, and finetuning techniques. As you go, you’ll gradually turn your base model into a text classifier tool, and a chatbot that follows your conversational instructions. About the reader For readers who know Python. Experience developing machine learning models is useful but not essential. About the author Sebastian Raschka has been working on machine learning and AI for more than a decade. Sebastian joined Lightning AI in 2022, where he now focuses on AI and LLM research, developing open-source software, and creating educational material. Prior to that, Sebastian worked at the University of Wisconsin-Madison as an assistant professor in the Department of Statistics, focusing on deep learning and machine learning research. He has a strong passion for education and is best known for his bestselling books on machine learning using open-source software.


Build a Large Language Model (From Scratch) Related Books

Build a Large Language Model (From Scratch)
Language: en
Pages: 0
Authors: Sebastian Raschka
Categories: Computers
Type: BOOK - Published: 2024-08-27 - Publisher: Manning

GET EBOOK

Learn how to create, train, and tweak large language models (LLMs) by building one from the ground up! In Build a Large Language Model (from Scratch), you’ll
LLM from Scratch
Language: en
Pages: 0
Authors: Anand Vemula
Categories: Computers
Type: BOOK - Published: 2024-06-07 - Publisher: Independently Published

GET EBOOK

"LLM from Scratch" is an extensive guide designed to take readers from the basics to advanced concepts of large language models (LLMs). It provides a thorough u
Building LLM Powered Applications
Language: en
Pages: 343
Authors: Valentina Alto
Categories: Computers
Type: BOOK - Published: 2024-05-22 - Publisher: Packt Publishing Ltd

GET EBOOK

Get hands-on with GPT 3.5, GPT 4, LangChain, Llama 2, Falcon LLM and more, to build LLM-powered sophisticated AI applications Key Features Embed LLMs into real-
Mastering Large Language Models
Language: en
Pages: 465
Authors: Sanket Subhash Khandare
Categories: Computers
Type: BOOK - Published: 2024-03-12 - Publisher: BPB Publications

GET EBOOK

Do not just talk AI, build it: Your guide to LLM application development KEY FEATURES ● Explore NLP basics and LLM fundamentals, including essentials, challen
Hands-On Large Language Models
Language: en
Pages: 428
Authors: Jay Alammar
Categories: Computers
Type: BOOK - Published: 2024-09-11 - Publisher: "O'Reilly Media, Inc."

GET EBOOK

AI has acquired startling new language capabilities in just the past few years. Driven by the rapid advances in deep learning, language AI systems are able to w