Back to HomeGet Started
Contact us
All Posts
Humor

Long, expensive, awesome

Blog Image
The image was generated by Midjourney
Published on
November 21, 2024

Disclaimer:
This text is sourced from Reddit and posted here with the author’s permission and blessing. The author requested to remain anonymous and to be referred to only by username: LegSubstantial2624. Source - Reddit link. Let this touch of humor offer a bit of comfort on your challenging journey toward mastering RAG and building a RAG system.‍

I know exactly how to build an awesome RAG. It’s as easy as a pie.

First, prepare your data. Use some cool vision quirk with hi_res option. Your, oh, about 400 pdf files will be processed in just a week or so, maybe a bit more… No biggie.

Make sure to use some smart chunking. Smth semantic, with embedings from OpenAI. I mean, come on, even a kid knows that.

But! Data prep doesn't stop there! You want it awesome,right? Every chunk needs to go through some LLM magic. Analyze it, enrich it so that every chunk is like Scrooge McDuck diving into his money bin. Keywords, summarization, all that jazz. Pick a pricey LLM, don’t be stingy. You want awesome, don’t you?

Ok, now for search. Simple stuff. Every query needs to be rephrased by LLM, like, 5-7 times, maybe 10. Less is pointless. So - each query will give you 10 new ones,but what a bunch!

Then, take them all into vector search. And the results? You guessed it! Straight into Cohere reranker! We’re going for awesome, remember? Don’t forget to merge the results.

And now, for the final touch - LLM on the output. Here is my suggestion: pick a few models, let each one do its job. Then, use yet another model to pick the best one. Or, you know, whichever…

And the most important rule - no open source, only proprietary, only hardcore!

P.S. Under every Reddit post, there’s always a comment saying, “Clearly, this post was written by ChatGPT.” Don’t bother. This post was entirely crafted by ChatGPT, no humans involved.

LegSubstantial2624
Blog Image
notes

RAG in 2025: Navigating the New Frontier of AI and Data Integration

We are on the brink of a world where AI not only understands the vast expanse of the internet but also comprehends your organization's unique data landscape—providing insights, answering complex questions, and even predicting future trends based on proprietary information.
Blog Image
notes

Why the Heck Do I Need RAG When I’ve Got ChatGPT?

The post highlights that while ChatGPT-4o is a powerful tool, relying solely on it for financial document analysis can be risky due to inaccuracies and the limitations of its Internet search capabilities. This demonstrates the importance of RAG in scenarios like financial analysis, where precision is critical.
Blog Image
Humor

Long, expensive, awesome

Here's a clear and simple way to build the best, most wonderful, amazing, perfect RAG. Let this touch of humor offer a bit of comfort on your challenging journey toward mastering RAG and building a RAG system.
Blog Image
Notes

Anticipated trends and advancements in RAG for 2025

By 2025 RAG is expected to become a foundational technology in corporate settings, driving advancements in data integration, secure deployments, and versatile applications across various industries.
Blog Image
Notes

Will the Larger Context Window Kill RAG?

Lately, there’s been a lot of buzz around the arrival of LLMs with large context windows — millions of tokens. Some people are already saying that this will make RAG obsolete.
View all posts
PricingContact
Copyright © 2024, QuePasa.ai.
All rights reserved.
Terms of ServicePrivacy Policy