The Large Language Models Bootcamp provides a comprehensive curriculum covering the entire stack of building enterprise LLMs, from fundamentals to advanced RAG techniques and fine-tuning. The bootcamp is designed to equip attendees with the skills to build custom LLMs within their organizations, regardless of their background, offering a training-ready infrastructure and a project with mentorship support.
Bootcamp Overview
• 00:00:08 The Data Science Dojo offers a comprehensive Large Language Models Bootcamp, the first in the industry, for building enterprise applications using LLMs and generative AI. It covers all mainstream tools and packages with practical exercises, culminating in a comprehensive project with mentoring.
LLM Challenges
• 00:01:27 Building enterprise-level LLMs presents numerous challenges, including regulatory and data governance issues related to intellectual property and PII. Further challenges include managing context windows, token usage, cost-efficiency, response latency, hallucination, and the complexities of defining correctness in the context of LLMs.
LLM Application Architecture
• 00:10:10 The core of an LLM application is the language model itself, hosted either on-premise or in the cloud, but successful development requires extensive software engineering expertise. Other vital components include vector databases for search indexing, embedding models for semantic search, LangChain for prompt templates and multi-agent systems, and observability/guardrails for monitoring and ensuring ethical usage.
Bootcamp Curriculum
• 00:17:12 The 5-day bootcamp begins with an overview of the LLM ecosystem, prompt engineering, and fundamental concepts like discriminative and generative learning. It progressively covers attention mechanisms, Transformer architecture, vector databases, and different search methods. The curriculum also delves into LangChain, fine-tuning, and evaluation techniques, culminating in a project to build a custom LLM application.
Prerequisites & Cost
• 00:16:13 The bootcamp is primarily Python-based and features a two-to-three-hour high-speed Python tutorial for those who need it. The next bootcamp starts on December 2nd, with in-person and remote options available. The cost is typically $5,000, but may be reduced to $3,700 or $4,000 depending on availability and enrollment timing.