Skip to content

[Task]: Vector Embedding Model Pipeline #42

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
lukasnxyz opened this issue Mar 12, 2025 · 1 comment
Open

[Task]: Vector Embedding Model Pipeline #42

lukasnxyz opened this issue Mar 12, 2025 · 1 comment
Assignees

Comments

@lukasnxyz
Copy link
Contributor

lukasnxyz commented Mar 12, 2025

Task Description

Need pipeline to convert data into vectors via openai embedding model and integrate into helixdb. Should be different from running locally vs. managed hosting. Should also offer an embedding model if its run locally.

Priority

Medium

@lukasnxyz
Copy link
Contributor Author

openai embedding models:

  • small dim size: 1536
  • large dim size: 3072
  • dim reduction is also possible via the api so might have to implement that better with current version of dim reduction in vector_optims branch

https://platform.openai.com/docs/guides/embeddings?lang=python#embedding-models

@lukasnxyz lukasnxyz self-assigned this Mar 22, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant