<?xml version="1.0" encoding="UTF-8"?>
<mods xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://www.loc.gov/mods/v3" version="3.1" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-1.xsd">
  <titleInfo>
    <title>Hands-on large language models</title>
    <subTitle>language understanding and generation</subTitle>
  </titleInfo>
  <name type="personal">
    <namePart>Alammar, Jay</namePart>
    <role>
      <roleTerm authority="marcrelator" type="text">creator</roleTerm>
    </role>
    <role>
      <roleTerm type="text">Author.</roleTerm>
    </role>
  </name>
  <name type="personal">
    <namePart>Grootendorst, Maarten</namePart>
    <role>
      <roleTerm type="text">Joint author.</roleTerm>
    </role>
  </name>
  <typeOfResource>text</typeOfResource>
  <genre authority="marc">bibliography</genre>
  <originInfo>
    <place>
      <placeTerm type="code" authority="marccountry">cau</placeTerm>
    </place>
    <place>
      <placeTerm type="text">Navi Mumbai</placeTerm>
    </place>
    <publisher>Shroff Publishers and Distributors Pvt. Ltd.</publisher>
    <dateIssued>2024</dateIssued>
    <copyrightDate encoding="marc">2024</copyrightDate>
    <issuance>monographic</issuance>
  </originInfo>
  <language>
    <languageTerm authority="iso639-2b" type="code">eng</languageTerm>
  </language>
  <physicalDescription>
    <form authority="marcform">print</form>
    <extent>xix, 403 pages : illustrations (some color) ; 24 cm</extent>
  </physicalDescription>
  <abstract>AI has acquired startling new language capabilities in just the past few years.  Driven by rapid advances in deep learning, language AI systems are able to write and understand text better than ever before.  This trend is enabling new features, products, and entire industries.  Through his book's visually educational nature, readers will learn practical tools and concepts they need to use these capabilities today.  You'll understand how to use pretrained language models for use cases like copywriting and summarization; create semantic search systems that go beyond keyword matching; and use existing libraries and pretrained models for text classification, search, and clusterings.  This book also helps you: Understand the architecture of transformer language models that excel at text generation and representation ; Build advanced LLM pipelines to cluster text documents and explore the topics they cover ; Build semantic search engines that go beyond keyword search, using methods like dense retrieval and rerankers ; Explore how generative models can be used, from prompt engineering all the way to retrieval-augmented generation ; Gain a deeper understanding of how to train LLMs and optimize them for specific applications using generative model fine-tuning, contrastive fine-tuning, and in-context learning.</abstract>
  <tableOfContents>Part 1. Understanding language models.  An introduction to Large Language Models -- Tokens and embeddings -- Looking inside Large Language Models -- Part 2. Using pretrained language models.  Text classification -- Text clustering and topic modeling -- Prompt engineering -- Advanced text generation techniques and tools -- Semantic search and retrieval-augmented generation -- Mulitimodal Large Language Models -- Part 3. Training and fine-tuning language models.  Creating text embedding models -- Fine-tuning representation models for classification -- Fine-tuning generation models.</tableOfContents>
  <note type="statement of responsibility">Jay Alammar and Maarten Grootendorst.</note>
  <note>Includes bibliographical references and index.</note>
  <subject authority="lcsh">
    <topic>Natural language generation (Computer science)</topic>
  </subject>
  <subject authority="lcsh">
    <topic>Artificial intelligence</topic>
    <topic>Computer programs</topic>
  </subject>
  <subject authority="lcsh">
    <topic>Machine learning</topic>
  </subject>
  <subject authority="lcsh">
    <topic>Software engineering</topic>
  </subject>
  <subject authority="lcsh">
    <topic>Artificial intelligence</topic>
    <topic>Engineering applications</topic>
  </subject>
  <subject authority="lcsh">
    <topic>Generative programming (Computer science)</topic>
  </subject>
  <subject authority="lcsh">
    <topic>Application software</topic>
    <topic>Development</topic>
  </subject>
  <subject authority="rvm">
    <topic>Intelligence artificielle</topic>
    <topic>Logiciels</topic>
  </subject>
  <classification authority="ddc">006.35 ALA/H</classification>
  <identifier type="isbn">9789355425522 (pbk.)</identifier>
  <recordInfo>
    <recordContentSource authority="marcorg">IN-BhIIT</recordContentSource>
    <recordCreationDate encoding="marc">250503</recordCreationDate>
    <recordChangeDate encoding="iso8601">20260508172028.0</recordChangeDate>
    <recordIdentifier source="IN-BhIIT">TB12526</recordIdentifier>
  </recordInfo>
</mods>
