（评论）

（评论）
(comments)

原始链接: https://news.ycombinator.com/item?id=40252569

用户基于 MLC AI 的 WebLLM 项目开发了一个名为“Secret Llama”的改进聊天界面，用于直接通过 Web 浏览器访问大型语言模型，例如 Mistral 7B 和 Llama 3，无需额外的依赖项或安装。这个新界面通过在本地存储用户数据来增强安全性。用户将其用作心理健康顾问和激励指南，称赞其便利性和隐私优势。下载的模型会缓存在浏览器中，以便重复访问时的响应速度更快。 Secret LLama 项目可在 GitHub 上根据 Apache 2.0 许可证公开访问，欢迎贡献，特别是在加快初始加载过程方面。提供了其代码库和演示网站的链接。【总结完】某人利用 WebLLM 项目创建了一个基于浏览器的高级聊天界面，名为“Secret Llama”。他们增强了该系统，以便在单个选项卡中提供与大型语言模型（包括 Mistral 7B 和 Llama 3）的无缝交互，而无需额外的软件。新界面通过将所有个人数据限制在用户自己的计算机上来确保增强的隐私性。事实证明，它作为治疗工具和激励教练非常有用。每次连续访问时，由于模型被缓存在 IndexedDB 中，加载速度会更快。这项开源计划在 Apache 2.0 许可证下发布，欢迎协作努力，特别是那些在初始加载过程中解决性能改进问题的努力。有关更多信息和贡献机会，请参阅给定的链接。

I spent the last few days building out a nicer ChatGPT-like interface to use Mistral 7B and Llama 3 fully within a browser (no deps and installs).

I’ve used the WebLLM project by MLC AI for a while to interact with LLMs in the browser when handling sensitive data but I found their UI quite lacking for serious use so I built a much better interface around WebLLM.

I’ve been using it as a therapist and coach. And it’s wonderful knowing that my personal information never leaves my local computer.

Should work on Desktop with Chrome or Edge. Other browsers are adding WebGPU support as well - see the Github for details on how you can get it to work on other browsers.

Note: after you send the first message, the model will be downloaded to your browser cache. That can take a while depending on the model and your internet connection. But on subsequent page loads, the model should be loaded from the IndexedDB cache so it should be much faster.

The project is open source (Apache 2.0) on Github. If you like it, I’d love contributions, particularly around making the first load faster.

Github: https://github.com/abi/secret-llama Demo: https://secretllama.com

（评论） (comments)

（评论）
(comments)