语音和语言处理 (第三版草稿)
Speech and Language Processing (3rd ed. draft)

原始链接: https://web.stanford.edu/~jurafsky/slp3/

《语音与语言处理》(第三版)将于2025年8月24日发布,其中包含重大更新和重组。主要新增内容包括使用DPO进行偏好对齐,关于Whisper的自动语音识别(ASR)新材料,以及关于EnCodec和VALL-E的文本到语音(TTS)。 本书的结构已修订,以优先考虑现代技术:逻辑回归现在介绍分类,并且LLM在Transformer架构*之前*进行讲解。响应学生反馈,RNN/LSTM章节被推迟,从而采用Transformer优先的方法。 关于对话和聊天机器人的内容已整合到LLM和对话结构章节中,以反映当前趋势。第二章经过修改,重点关注token和Unicode。已修复了大量错别字,并通过[email protected]提供草稿以供反馈。作者对社区贡献表示感谢,这些贡献对本书的开发至关重要。

黑客新闻新 | 过去 | 评论 | 提问 | 展示 | 招聘 | 提交登录 语音和语言处理 (第三版草稿) (stanford.edu) 5 分,atomicnature 发表于 1 小时前 | 隐藏 | 过去 | 收藏 | 讨论 指南 | 常见问题 | 列表 | API | 安全 | 法律 | 申请YC | 联系 搜索:
相关文章

原文

Here's our August 24, 2025 release!

This release has

  • preference alignment with DPO in the posttraining Chapter 9
  • completely new ASR (Whisper) and TTS (EnCodec and VALL-E) material in Chapter 15 and 16
  • a restructuring of earlier chapters to fit how we are teaching now:
    • move Naive Bayes to the Appendix and instead using Logistic Regression to teach about classification
    • Moving PPMI to the appendix and tf-idf only in Chapter 11, to move more quickly through sparse vectors
  • the concept of LLMs, and LLM sampling and training introduced in chapter 7, before introducing the internals with the transformer in Chapter 8.
  • RNN/LSTM chapter delayed to 13, because students have asked to go directly to Transformers without first learning RNNs. The new structure allows either order (LSTM/Transformer or Transformer/LSTM).
  • a restructured Chapter 2 to focus more on tokens and words and introduce Unicode.
  • typo fixes (thanks again to all of you!)
  • some new slides
  • The dialogue and chatbot Chapter was divided up and folded into various other chapters, now that LLMs tend to have replaced most earlier chatbot architectures. Much of the introduction and the ethics section went into the LLM chapter. The summary of human conversational structure went to the new chapter 25 "Conversation and its structure". The frame-based dialogue agents section is currently in Appendix chapter J, although that may change.
Individual chapters and updated slides are below.

Here is a single pdf of Aug 24, 2025 book!

  1. Feel free to use the draft chapters and slides in your classes, print it out, whatever, the resulting feedback we get from you makes the book better!
  2. Typos and comments are very welcome (just email [email protected] and let us know the date on the draft)! (Don't bother reporting missing refs due to cross-chapter cross-reference problems in the indvidual chapter pdfs, those are fixed in the full book draft)
  3. Gratitude! We've put up a list here of the amazing people who have sent so many fantastic suggestions and bug-fixes for improving the book. We are really grateful to all of you for your help, the book would not be possible without you!
  4. How to cite the book:

    Daniel Jurafsky and James H. Martin. 2025. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition with Language Models, 3rd edition. Online manuscript released August 24, 2025. https://web.stanford.edu/~jurafsky/slp3.

  5. A bib entry for the book is here.
    @Book{jm3,
      author =       "Daniel Jurafsky and James H. Martin",
      title =        "Speech and Language Processing: An Introduction to Natural Language Processing, 
      		  Computational Linguistics, and Speech Recognition,
    		   with Language Models",
      year =         "2025",
      url = {https://web.stanford.edu/~jurafsky/slp3/},
      note = "Online manuscript released August 24, 2025",
      edition =         "3rd",
      }
    
  6. When will the book be finished? Don't ask.
  7. If you need the previous Jan 2025 draft chapters, they are here; if you need the previous Aug 2024 draft chapters, they are here;
联系我们 contact @ memedata.com