Generate text embeddings for RAG (Retrieval-Augmented Generation), semantic search, and similarity comparisons using the Llama provider.
Generate embeddings for multiple texts:
Configure the embedding model with specific options:
| Option | Type | Default | Description |
|---|---|---|---|
normalize | number | -1 | Embedding normalization mode |
contextParams.n_ctx | number | 2048 | Context window size |
contextParams.n_gpu_layers | number | 99 | GPU layers for acceleration |
contextParams.n_parallel | number | 8 | Number of parallel embeddings |
Release resources when done: