面向产品构建基于RAG的LLM应用

1,107次阅读

详细介绍了如何从头开始构建一个基于检索增强生成(RAG)的大型语言模型(LLM)应用。

主要步骤包括：

加载数据、分割文本、嵌入数据、索引数据、检索相关文本块、生成回复。
为了扩展应用，实现了在Ray Data上进行并行计算的功能。
为评估不同系统配置，实现了组件级评估和端到端评估。
比较了不同的文本块大小、块数、嵌入模型和LLM的性能。
实现了查询路由，根据查询复杂性将其发送到合适的LLM。
使用Ray Serve架构应用，实现弹性伸缩。
讨论了LLM应用的一阶和二阶影响。
提出后续工作，包括持续更新、微调嵌入模型和LLM、收集用户反馈等。
强调了Ray和Anyscale如何帮助构建、扩展和产品化LLM应用。

GitHub: github.com/ray-project/llm-applications
Notebook: github.com/ray-project/llm-applications/blob/main/notebooks/rag.ipynb

正文完

可以使用微信扫码关注公众号（ID：xzluomor）

AI AR HTML RSS 产品大型语言模型架构

发表至：智源

2023-09-14

小冰正式发布克隆人：已经有人拿它年入100万了！

OpenAGI:当大语言模型遇到领域专家

博士论文 | 用于计算机辅助药物发现的机器学习方法 309页

文心4.0加持、0代码开发，自带流量的智能体平台来了！

Stability最新发布的生成式音频模型Stable Audio

卡内基梅隆大学计算机学院推出「机器人」本科学位

评论（没有评论）

文章搜索

最新评论

ufabet มีเกมให้เลือกเล่นมากมาย: เกมเดิมพันหลากหลาย ครบทุกค่ายดัง

tornado crypto mixer Discover the power of privacy with TornadoCash! Learn how this decentralized mixer ensures your transactions remain confidential.

ดูบอลสด Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

ดูบอลสด Pretty! This has been a really wonderful post. Many thanks for providing these details.

ดูบอลสด Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

Obrazy Sztuka Nowoczesna Thank you for this wonderful contribution to the topic. Your ability to explain complex ideas simply is admirable.

ufabet Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

ufabet You’re so awesome! I don’t believe I have read a single thing like that before. So great to find someone with some original thoughts on this topic. Really.. thank you for starting this up. This website is something that is needed on the internet, someone with a little originality!

ufabet Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

热评文章

Generated by Feedzy

面向产品构建基于RAG的LLM应用

超越DeepSeek-R1，数学形式化准确率飙升至84% | 字节&南大开源

开源Qwen一周连刷三冠，暴击闭源模型！基础模型推理编程均SOTA

这个5亿播放的AI视频，邪乎得平平无奇

TRAE推出SOLO模式，业内首个「Context Engineer」来了

B站亮相2025世界人工智能大会，发布最受年轻人关注的TOP30 AI应用

刘强东连投3家具身智能！京东美团「战火」烧到外卖之外

3亿美元薪酬被10人拒绝！OpenAI首席研究官一句话引发硅谷史上最疯狂抢人大战

蚂蚁ACL活动全览！论文串讲、人才专项答疑与闭门晚宴等你报名

手术刀式去噪突破LLM能力上限，从头预训练模型下游任务平均提高7.2% | 中科院＆阿里

IMO怒斥OpenAI自封夺金，“91位评委均未参与评分”