HNNewShowAskJobs
Built with Tanstack Start
Retrieval Augmented Generation Based on SQLite(github.com)
87 points by emzo 5 days ago | 12 comments
  • anoojb4 days ago

    Would love to see a system that blends cheap lexical (Fulltext Search) or semantic/vector search using SQLite and chooses the best approach given the input.

    • bob10294 days ago |parent

      If you want the best possible solution vertical for most business, I'd be looking at using Lucene for FTS duty.

      Having the FTS engine provide a google-style snippet of the most relevant document chunk is the holy grail for RAG applications. Lucene does this kind of thing better than anyone else:

      https://lucene.apache.org/core/8_0_0/highlighter/org/apache/...

      It is also very easy to customize this engine and align the document tokenization & indexing concerns with your specific retrieval scenarios.

  • wredcoll4 days ago

    This looks cool, and I'm interested in these keywords, but I read the entire readme and I'm still unsure what problem it's actually solving.

    Anyone want to help out?

    • webstrand4 days ago |parent

      This is for LLMs. In general RAG takes a user prompt and uses it to find potentially relevant documents in the database. It then enriches the original prompt with those documents so that the LLM has context that wasn't in its training dataset.

    • Octplane4 days ago |parent

      RAG -> Vector search -> means that your documents are not indexed as full text but as Vectorized objects which mean that then you can search using concepts instead of exacts strings you would use with a regular "Fulltext search".

      This makes the search less precise and more powerful at the same time (ie it could look clever to some extent).

    • ncruces4 days ago |parent

      This might provide context: https://alexgarcia.xyz/blog/2024/sqlite-vec-hybrid-search/in...

  • 3 days ago
    [deleted]
  • rcarmo4 days ago

    This and SQLite-vec (or whatever extension is trendy these days) can do a lot in a very limited amount of compute.

  • 4 days ago
    [deleted]
  • almosthere4 days ago

    Sqlite has an embedding search? Or is that being provided by this tool?

    • Octplane4 days ago |parent

      It's provided via https://github.com/asg017/sqlite-vec

    • ethan_smith4 days ago |parent

      SQLite itself doesn't have native embedding search, but extensions like sqlite-vss and sqlite-vectorize add vector similarity search capabilities to SQLite.