| ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||
![]() |
原始链接: https://news.ycombinator.com/item?id=43694546
Cohere发布了新的嵌入模型Embed 4,并被赞誉为潜在的最佳模型。然而,用户simonw对依赖专有、仅API的模型持保留意见,尤其是在存在强大的开源替代方案的情况下。他称赞Nomic的做法:通过API或开源权重提供模型以供非商业用途,之后再根据Apache 2.0许可,从而确保模型的长期可用性。 另一位用户lukebuehler强调了多模态嵌入选项的匮乏,特别是文本和图像的结合,他认为Embed 4是一个受欢迎的补充。他还指出了谷歌模型的限制,即其仅支持30个文本token。 第三位用户moojacob指出,在基准测试中,Embed 4的表现不如voyage-3-large,并质疑了嵌入模型基准测试的整体效用。moralestapia承认该模型价格昂贵,但强调其基准测试结果很有前景。
| ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||
![]() |
I really like the approach Nomic take: their most recent models are available via their API or as open weights for non-commercial use only (unless you buy a license). They later relicense their older models under Apache 2.0 licenses.
This gives me confidence that I can continue to use my calculated vectors in the future even if Nomic's model is no longer available because I can run the local one instead.
Nomic Embed Vision 1.5 for example started out as CC-BY-NC-4.0 but was later relicensed to Apache 2.0: https://www.nomic.ai/blog/posts/nomic-embed-vision
reply