More

nutellalover · on Aug 30, 2024

Codeviz is such a cool project. Love having it in VSCode.

LiamPrevelige · on Aug 30, 2024

Glad you like it! Let us know if you have any suggestions

nutellalover · on Aug 29, 2024

Big fans of Cursor ourselves. One of the goals with this library is to make it easy for maintainers of OSS projects to expose chat support functionality to their users in a very streamlined, easy-to-setup fashion.

So yes you can certainly use to index and query your own repos for yourself, but it's also a way to get more of your OSS lib users onboarded.

nutellalover · on Aug 29, 2024

Great question. For most small repos (10-20 source files) this works incredibly well out-of-the-box.

We stress-tested with repos like langchain, llamaindex, kubernetes and there the retrieval still needs work to effectively return relevant chunks. This is still an open research question.

nutellalover · on Aug 29, 2024

The truth is we started there. But for any reasonably-sized, complex codebase this just isn't going to work as the context window isn't sufficient and moreover it becomes harder for the LLM to reason over arbitrary parts of the context.

For the time being, indexing and retrieving a good collection of 10-20 code chunks is more effective/performant in practice.

nutellalover · on Aug 29, 2024

This is a great read Simon.

nutellalover · on Aug 29, 2024

Definitely agree that the trend is toward lower cost where a lot of these use-cases are unlocked. Especially as all the major 3rd party LLM providers scramble to ship better models to retain mind-share.

nutellalover · on Aug 29, 2024

We ran some qualitative tests and there was a quality difference. In fact, benchmarks show that trend to generally hold: https://archersama.github.io/coir/

That being said, our goal was to make the library modular so you can easily add support for whatever embeddings you want. Definitely encourage experimenting for your use-case because even in our tests, we found that trends which hold true in research benchmarks don't always translate to custom use-cases.

peterldowns · on Aug 29, 2024

> we found that trends which hold true in research benchmarks don't always translate to custom use-cases.

Exactly why I asked! If you don't mind a followup question, how were you evaluating embeddings models — was it mostly just vibes on your own repos, or something more rigorous? Asking because I'm working on something similar and based on what you've shipped, I think I could learn a lot from you!

nutellalover · on Aug 29, 2024

Happy to help!

At the beginning, we started with qualitative "vibe" checks where we could iterate quickly and the delta in quality was still so significant that we could obviously see what was performing better.

Once we stopped trusting our ability to discern differences, we actually bit the bullet and made a small eval benchmark set (~20 queries across 3 repos of different sizes) and then used that to guide algorithmic development.

peterldowns · on Aug 29, 2024

Thank you, I appreciate the details.

nutellalover · on Aug 28, 2024

You can certainly apply to a private repo. If you want to ensure data stays local, you would have to add support for an OSS embedding/LLM model (of which there are many good offerings to pick from).

infocollector · on Aug 29, 2024

Please update TL;DR: repo2vec is a simple-to-use, modular library enabling you to chat with any public or private codebase "by sending data to OpenAI."

nutellalover · on Aug 29, 2024

Thanks for the note! We welcome contributions!

nutellalover · on Aug 28, 2024

Yup! We use tree-sitter and parse it at the file-level.

nutellalover · on Aug 28, 2024

Thanks!

This is a great idea. Definitely something we plan to support.