SplitQuantV2：无需GPU即可增强大型语言模型的低比特量化

SplitQuantV2：无需GPU即可增强大型语言模型的低比特量化
SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs

arXivLabs是一个框架，允许合作者直接在我们的网站上开发和分享新的arXiv功能。与arXivLabs合作的个人和组织都已接受并认同我们开放、社区、卓越和用户数据隐私的价值观。arXiv 致力于这些价值观，并且只与遵守这些价值观的合作伙伴合作。是否有能为arXiv社区增值的项目创意？了解更多关于arXivLabs的信息。

Hacker News 最新 | 过去 | 评论 | 提问 | 展示 | 招聘 | 提交登录 SplitQuantV2：无需GPU即可增强大型语言模型的低比特量化 (arxiv.org) 10 分，来自 PaulHoule，3 小时前 | 隐藏 | 过去 | 收藏 | 讨论加入我们，参加 6 月 16-17 日在旧金山举办的 AI 初创企业学校！指南 | 常见问题 | 列表 | API | 安全 | 法律 | 申请 YC | 联系我们搜索：

通过多标记预测更好更快的大型语言模型 2024-05-02

LLM 量化可视化指南 2024-07-31

SmolDocling：一款用于端到端多模态文档转换的超紧凑型大型语言模型 2025-03-21

在最先进的法学硕士中展示推理失败的简单任务 2024-06-06

原文

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

SplitQuantV2：无需GPU即可增强大型语言模型的低比特量化 SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs

SplitQuantV2：无需GPU即可增强大型语言模型的低比特量化
SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs