Building Large Language Models from Scratch
von APRESS
Building Large Language Models from Scratch
von APRESS
inkl. Ust.
65,99 €
Lieferung
Lieferung am Di. 02.06.2026
Händler*in
BMS
Der*die Händler*in gewährt für dieses Produkt eine Widerrufsfrist von 30 Tagen. Für Details lies bitte die Widerrufsbelehrung und das -formular sowie die jeweiligen Händler-AGB.
Produktdetails
This book is a complete, hands-on guide to designing, training, and deploying your own Large Language Models (LLMs)—from the foundations of tokenization to the advanced stages of fine-tuning and reinforcement learning. Written for developers, data scientists, and AI practitioners, it bridges core principles and state-of-the-art techniques, offering a rare, transparent look at how modern transformers truly work beneath the surface. Starting from the essentials, you’ll learn how to set up your environment with Python and PyTorch, manage datasets, and implement critical fundamentals such as tensors, embeddings, and gradient descent. You’ll then progress through the architectural heart of modern models, covering RMS normalization, rotary positional embeddings (RoPE), scaled dot-product attention, Grouped Query Attention (GQA), Mixture of Experts (MoE), and SwiGLU activations, each explored in depth and built step by step in code. As you advance, the book introduces custom CUDA kernel integration, teaching you how to optimize key components for speed and memory efficiency at the GPU level—an essential skill for scaling real-world LLMs. You’ll also gain mastery over the phases of training that define today’s leading models: Pretraining - Building general linguistic and semantic understanding. Midtraining - Expanding domain-specific capabilities and adaptability. Supervised Fine-Tuning (SFT) - Aligning behavior with curated, task-driven data. Reinforcement Learning from Human Feedback (RLHF) - Refining responses through reward-based optimization for human alignment. The final chapters guide you through dataset preparation, filtering, deduplication, and training optimization, culminating in model evaluation and real-world prompting with a custom TokenGenerator for text generation and inference. By the end of this book, you’ll have the knowledge and confidence to architect, train, and deploy your own transformer-based models, equipped with both the theoretical depth...
Infotabelle
Produktspezifikationen
| Autor | Dilyan Grigorov |
| Format | gebundene Ausgabe |
| Sprachfassung | Englisch |
| Seiten | 530 |
| Erscheinungsdatum | 2026-04-28 |
| Verlag | APRESS |
Produktkennung
| Artikelnummer | m0000THZJS |
| EAN | 9798868822964 |
| GTIN | 09798868822964 |
Zusatzinfo und Downloads
Details zur Produktsicherheit
| Herstellerinformationen |
| Verantwortliche Person für die EU |
| Entsorgungshinweise |
Produktdetails
This book is a complete, hands-on guide to designing, training, and deploying your own Large Language Models (LLMs)—from the foundations of tokenization to the advanced stages of fine-tuning and reinforcement learning. Written for developers, data scientists, and AI practitioners, it bridges core principles and state-of-the-art techniques, offering a rare, transparent look at how modern transformers truly work beneath the surface. Starting from the essentials, you’ll learn how to set up your environment with Python and PyTorch, manage datasets, and implement critical fundamentals such as tensors, embeddings, and gradient descent. You’ll then progress through the architectural heart of modern models, covering RMS normalization, rotary positional embeddings (RoPE), scaled dot-product attention, Grouped Query Attention (GQA), Mixture of Experts (MoE), and SwiGLU activations, each explored in depth and built step by step in code. As you advance, the book introduces custom CUDA kernel integration, teaching you how to optimize key components for speed and memory efficiency at the GPU level—an essential skill for scaling real-world LLMs. You’ll also gain mastery over the phases of training that define today’s leading models: Pretraining - Building general linguistic and semantic understanding. Midtraining - Expanding domain-specific capabilities and adaptability. Supervised Fine-Tuning (SFT) - Aligning behavior with curated, task-driven data. Reinforcement Learning from Human Feedback (RLHF) - Refining responses through reward-based optimization for human alignment. The final chapters guide you through dataset preparation, filtering, deduplication, and training optimization, culminating in model evaluation and real-world prompting with a custom TokenGenerator for text generation and inference. By the end of this book, you’ll have the knowledge and confidence to architect, train, and deploy your own transformer-based models, equipped with both the theoretical depth...
Infotabelle
Produktspezifikationen
| Autor | Dilyan Grigorov |
| Format | gebundene Ausgabe |
| Sprachfassung | Englisch |
| Seiten | 530 |
| Erscheinungsdatum | 2026-04-28 |
| Verlag | APRESS |
Produktkennung
| Artikelnummer | m0000THZJS |
| EAN | 9798868822964 |
| GTIN | 09798868822964 |
Zusatzinfo und Downloads
Details zur Produktsicherheit
| Herstellerinformationen |
| Verantwortliche Person für die EU |
| Entsorgungshinweise |
Top Produkte der Kategorie
Weitere Kategorien
Bücher, Musik & Filme Bücher Fachbücher Informatik Geschichtswissenschaft Recht Theologie Psychologie Politikwissenschaft Wirtschaft Medienwissenschaft Ethnologie Philosophie Technik Sozialwissenschaft Pädagogik Sprach- & Literaturwissenschaft Mathematik Biowissenschaften Allgemeine Naturwissenschaften Allgemeine Geisteswissenschaften Physik Geowissenschaften Musikwissenschaft Kunstwissenschaft Chemie Medizin











