聊聊Spring AI的EmbeddingModel
zhezhongyun 2025-04-29 06:49 3 浏览
序
本文主要研究一下Spring AI的EmbeddingModel
EmbeddingModel
spring-ai-core/src/main/java/org/springframework/ai/embedding/EmbeddingModel.java
public interface EmbeddingModel extends Model<EmbeddingRequest, EmbeddingResponse> {
@Override
EmbeddingResponse call(EmbeddingRequest request);
/**
* Embeds the given text into a vector.
* @param text the text to embed.
* @return the embedded vector.
*/
default float[] embed(String text) {
Assert.notNull(text, "Text must not be null");
List<float[]> response = this.embed(List.of(text));
return response.iterator().next();
}
/**
* Embeds the given document's content into a vector.
* @param document the document to embed.
* @return the embedded vector.
*/
float[] embed(Document document);
/**
* Embeds a batch of texts into vectors.
* @param texts list of texts to embed.
* @return list of embedded vectors.
*/
default List<float[]> embed(List<String> texts) {
Assert.notNull(texts, "Texts must not be null");
return this.call(new EmbeddingRequest(texts, EmbeddingOptionsBuilder.builder().build()))
.getResults()
.stream()
.map(Embedding::getOutput)
.toList();
}
/**
* Embeds a batch of {@link Document}s into vectors based on a
* {@link BatchingStrategy}.
* @param documents list of {@link Document}s.
* @param options {@link EmbeddingOptions}.
* @param batchingStrategy {@link BatchingStrategy}.
* @return a list of float[] that represents the vectors for the incoming
* {@link Document}s. The returned list is expected to be in the same order of the
* {@link Document} list.
*/
default List<float[]> embed(List<Document> documents, EmbeddingOptions options, BatchingStrategy batchingStrategy) {
Assert.notNull(documents, "Documents must not be null");
List<float[]> embeddings = new ArrayList<>(documents.size());
List<List<Document>> batch = batchingStrategy.batch(documents);
for (List<Document> subBatch : batch) {
List<String> texts = subBatch.stream().map(Document::getText).toList();
EmbeddingRequest request = new EmbeddingRequest(texts, options);
EmbeddingResponse response = this.call(request);
for (int i = 0; i < subBatch.size(); i++) {
embeddings.add(response.getResults().get(i).getOutput());
}
}
Assert.isTrue(embeddings.size() == documents.size(),
"Embeddings must have the same number as that of the documents");
return embeddings;
}
/**
* Embeds a batch of texts into vectors and returns the {@link EmbeddingResponse}.
* @param texts list of texts to embed.
* @return the embedding response.
*/
default EmbeddingResponse embedForResponse(List<String> texts) {
Assert.notNull(texts, "Texts must not be null");
return this.call(new EmbeddingRequest(texts, EmbeddingOptionsBuilder.builder().build()));
}
/**
* Get the number of dimensions of the embedded vectors. Note that by default, this
* method will call the remote Embedding endpoint to get the dimensions of the
* embedded vectors. If the dimensions are known ahead of time, it is recommended to
* override this method.
* @return the number of dimensions of the embedded vectors.
*/
default int dimensions() {
return embed("Test String").length;
}
}
EmbeddingModel继承了Model接口,其入参类型为EmbeddingRequest,返回类型为EmbeddingResponse,它定义了call、embed接口,提供了embed、embedForResponse、dimensions的默认实现
EmbeddingRequest
spring-ai-core/src/main/java/org/springframework/ai/embedding/EmbeddingRequest.java
public class EmbeddingRequest implements ModelRequest<List<String>> {
private final List<String> inputs;
private final EmbeddingOptions options;
public EmbeddingRequest(List<String> inputs, EmbeddingOptions options) {
this.inputs = inputs;
this.options = options;
}
@Override
public List<String> getInstructions() {
return this.inputs;
}
@Override
public EmbeddingOptions getOptions() {
return this.options;
}
}
EmbeddingRequest实现了ModelRequest接口,其getInstructions返回的是List<String>
EmbeddingResponse
spring-ai-core/src/main/java/org/springframework/ai/embedding/EmbeddingResponse.java
public class EmbeddingResponse implements ModelResponse<Embedding> {
/**
* Embedding data.
*/
private final List<Embedding> embeddings;
/**
* Embedding metadata.
*/
private final EmbeddingResponseMetadata metadata;
/**
* Creates a new {@link EmbeddingResponse} instance with empty metadata.
* @param embeddings the embedding data.
*/
public EmbeddingResponse(List<Embedding> embeddings) {
this(embeddings, new EmbeddingResponseMetadata());
}
/**
* Creates a new {@link EmbeddingResponse} instance.
* @param embeddings the embedding data.
* @param metadata the embedding metadata.
*/
public EmbeddingResponse(List<Embedding> embeddings, EmbeddingResponseMetadata metadata) {
this.embeddings = embeddings;
this.metadata = metadata;
}
/**
* @return Get the embedding metadata.
*/
public EmbeddingResponseMetadata getMetadata() {
return this.metadata;
}
@Override
public Embedding getResult() {
Assert.notEmpty(this.embeddings, "No embedding data available.");
return this.embeddings.get(0);
}
/**
* @return Get the embedding data.
*/
@Override
public List<Embedding> getResults() {
return this.embeddings;
}
//......
}
EmbeddingResponse实现了ModelResponse接口,其result为Embedding类型
AbstractEmbeddingModel
spring-ai-core/src/main/java/org/springframework/ai/embedding/AbstractEmbeddingModel.java
public abstract class AbstractEmbeddingModel implements EmbeddingModel {
private static final Map<String, Integer> KNOWN_EMBEDDING_DIMENSIONS = loadKnownModelDimensions();
/**
* Default constructor.
*/
public AbstractEmbeddingModel() {
}
/**
* Cached embedding dimensions.
*/
protected final AtomicInteger embeddingDimensions = new AtomicInteger(-1);
/**
* Return the dimension of the requested embedding generative name. If the generative
* name is unknown uses the EmbeddingModel to perform a dummy EmbeddingModel#embed and
* count the response dimensions.
* @param embeddingModel Fall-back client to determine, empirically the dimensions.
* @param modelName Embedding generative name to retrieve the dimensions for.
* @param dummyContent Dummy content to use for the empirical dimension calculation.
* @return Returns the embedding dimensions for the modelName.
*/
public static int dimensions(EmbeddingModel embeddingModel, String modelName, String dummyContent) {
if (KNOWN_EMBEDDING_DIMENSIONS.containsKey(modelName)) {
// Retrieve the dimension from a pre-configured file.
return KNOWN_EMBEDDING_DIMENSIONS.get(modelName);
}
else {
// Determine the dimensions empirically.
// Generate an embedding and count the dimension size;
return embeddingModel.embed(dummyContent).length;
}
}
private static Map<String, Integer> loadKnownModelDimensions() {
try {
Properties properties = new Properties();
properties.load(new DefaultResourceLoader()
.getResource("classpath:/embedding/embedding-model-dimensions.properties")
.getInputStream());
return properties.entrySet()
.stream()
.collect(Collectors.toMap(e -> e.getKey().toString(), e -> Integer.parseInt(e.getValue().toString())));
}
catch (IOException e) {
throw new RuntimeException(e);
}
}
@Override
public int dimensions() {
if (this.embeddingDimensions.get() < 0) {
this.embeddingDimensions.set(dimensions(this, "Test", "Hello World"));
}
return this.embeddingDimensions.get();
}
}
AbstractEmbeddingModel实现了EmbeddingModel接口定义的dimensions方法,它在不同模块有不同的实现子类,比如spring-ai-openai的OpenAiEmbeddingModel、spring-ai-ollama的OllamaEmbeddingModel、spring-ai-minimax的MiniMaxEmbeddingModel等
OllamaEmbeddingAutoConfiguration
org/springframework/ai/model/ollama/autoconfigure/OllamaEmbeddingAutoConfiguration.java
@AutoConfiguration(after = RestClientAutoConfiguration.class)
@ConditionalOnClass(OllamaEmbeddingModel.class)
@ConditionalOnProperty(name = SpringAIModelProperties.EMBEDDING_MODEL, havingValue = SpringAIModels.OLLAMA,
matchIfMissing = true)
@EnableConfigurationProperties({ OllamaEmbeddingProperties.class, OllamaInitializationProperties.class })
@ImportAutoConfiguration(classes = { OllamaApiAutoConfiguration.class, RestClientAutoConfiguration.class,
WebClientAutoConfiguration.class })
public class OllamaEmbeddingAutoConfiguration {
@Bean
@ConditionalOnMissingBean
public OllamaEmbeddingModel ollamaEmbeddingModel(OllamaApi ollamaApi, OllamaEmbeddingProperties properties,
OllamaInitializationProperties initProperties, ObjectProvider<ObservationRegistry> observationRegistry,
ObjectProvider<EmbeddingModelObservationConvention> observationConvention) {
var embeddingModelPullStrategy = initProperties.getEmbedding().isInclude()
? initProperties.getPullModelStrategy() : PullModelStrategy.NEVER;
var embeddingModel = OllamaEmbeddingModel.builder()
.ollamaApi(ollamaApi)
.defaultOptions(properties.getOptions())
.observationRegistry(observationRegistry.getIfUnique(() -> ObservationRegistry.NOOP))
.modelManagementOptions(new ModelManagementOptions(embeddingModelPullStrategy,
initProperties.getEmbedding().getAdditionalModels(), initProperties.getTimeout(),
initProperties.getMaxRetries()))
.build();
observationConvention.ifAvailable(embeddingModel::setObservationConvention);
return embeddingModel;
}
}
OllamaEmbeddingAutoConfiguration在spring.ai.model.embedding为ollama时启用,它自动配置了OllamaEmbeddingModel
OllamaEmbeddingProperties
org/springframework/ai/model/ollama/autoconfigure/OllamaEmbeddingProperties.java
@ConfigurationProperties(OllamaEmbeddingProperties.CONFIG_PREFIX)
public class OllamaEmbeddingProperties {
public static final String CONFIG_PREFIX = "spring.ai.ollama.embedding";
/**
* Client lever Ollama options. Use this property to configure generative temperature,
* topK and topP and alike parameters. The null values are ignored defaulting to the
* generative's defaults.
*/
@NestedConfigurationProperty
private OllamaOptions options = OllamaOptions.builder().model(OllamaModel.MXBAI_EMBED_LARGE.id()).build();
public String getModel() {
return this.options.getModel();
}
public void setModel(String model) {
this.options.setModel(model);
}
public OllamaOptions getOptions() {
return this.options;
}
}
OllamaEmbeddingProperties主要是提供了OllamaOptions属性配置,具体可以参考
https://github.com/ggerganov/llama.cpp/blob/master/examples/main/README.md
OllamaInitializationProperties
org/springframework/ai/model/ollama/autoconfigure/OllamaInitializationProperties.java
@ConfigurationProperties(OllamaInitializationProperties.CONFIG_PREFIX)
public class OllamaInitializationProperties {
public static final String CONFIG_PREFIX = "spring.ai.ollama.init";
/**
* Chat models initialization settings.
*/
private final ModelTypeInit chat = new ModelTypeInit();
/**
* Embedding models initialization settings.
*/
private final ModelTypeInit embedding = new ModelTypeInit();
/**
* Whether to pull models at startup-time and how.
*/
private PullModelStrategy pullModelStrategy = PullModelStrategy.NEVER;
/**
* How long to wait for a model to be pulled.
*/
private Duration timeout = Duration.ofMinutes(5);
/**
* Maximum number of retries for the model pull operation.
*/
private int maxRetries = 0;
public PullModelStrategy getPullModelStrategy() {
return this.pullModelStrategy;
}
public void setPullModelStrategy(PullModelStrategy pullModelStrategy) {
this.pullModelStrategy = pullModelStrategy;
}
public ModelTypeInit getChat() {
return this.chat;
}
public ModelTypeInit getEmbedding() {
return this.embedding;
}
public Duration getTimeout() {
return this.timeout;
}
public void setTimeout(Duration timeout) {
this.timeout = timeout;
}
public int getMaxRetries() {
return this.maxRetries;
}
public void setMaxRetries(int maxRetries) {
this.maxRetries = maxRetries;
}
public static class ModelTypeInit {
/**
* Include this type of models in the initialization task.
*/
private boolean include = true;
/**
* Additional models to initialize besides the ones configured via default
* properties.
*/
private List<String> additionalModels = List.of();
public boolean isInclude() {
return this.include;
}
public void setInclude(boolean include) {
this.include = include;
}
public List<String> getAdditionalModels() {
return this.additionalModels;
}
public void setAdditionalModels(List<String> additionalModels) {
this.additionalModels = additionalModels;
}
}
}
OllamaInitializationProperties提供了spring.ai.ollama.init即ollama初始化的相关配置,其中ModelTypeInit可以指定初始化哪些额外的model
示例
pom.xml
<dependency>
<groupId>org.springframework.ai</groupId>
<artifactId>spring-ai-starter-model-ollama</artifactId>
</dependency>
配置
spring:
ai:
model:
embedding: ollama
ollama:
init:
timeout: 5m
max-retries: 0
embedding:
include: true
additional-models: []
base-url: http://localhost:11434
embedding:
enabled: true
options:
model: bge-m3:latest
truncate: true
example
@Test
public void testCall() {
EmbeddingRequest request = new EmbeddingRequest(List.of("Hello World", "World is big and salvation is near"),
OllamaOptions.builder()
.model("bge-m3:latest")
.truncate(false)
.build());
EmbeddingResponse embeddingResponse = embeddingModel.call(request);
log.info("resp:{}", JSON.toJSONString(embeddingResponse));
}
小结
Spring AI定义了EmbeddingModel接口,它继承了Model接口,其入参类型为EmbeddingRequest,返回类型为EmbeddingResponse,它定义了call、embed接口,提供了embed、embedForResponse、dimensions的默认实现;AbstractEmbeddingModel实现了EmbeddingModel接口定义的dimensions方法,它在不同模块有不同的实现子类,比如spring-ai-openai的OpenAiEmbeddingModel、spring-ai-ollama的OllamaEmbeddingModel、spring-ai-minimax的MiniMaxEmbeddingModel等;
OllamaEmbeddingAutoConfiguration在spring.ai.model.embedding为ollama时启用,它自动配置了OllamaEmbeddingModel。
doc
- embeddings
- ollama-embeddings
相关推荐
- JPA实体类注解,看这篇就全会了
-
基本注解@Entity标注于实体类声明语句之前,指出该Java类为实体类,将映射到指定的数据库表。name(可选):实体名称。缺省为实体类的非限定名称。该名称用于引用查询中的实体。不与@Tab...
- Dify教程02 - Dify+Deepseek零代码赋能,普通人也能开发AI应用
-
开始今天的教程之前,先解决昨天遇到的一个问题,docker安装Dify的时候有个报错,进入Dify面板的时候会出现“InternalServerError”的提示,log日志报错:S3_USE_A...
- 用离散标记重塑人体姿态:VQ-VAE实现关键点组合关系编码
-
在人体姿态估计领域,传统方法通常将关键点作为基本处理单元,这些关键点在人体骨架结构上代表关节位置(如肘部、膝盖和头部)的空间坐标。现有模型对这些关键点的预测主要采用两种范式:直接通过坐标回归或间接通过...
- B 客户端流RPC (clientstream Client Stream)
-
客户端编写一系列消息并将其发送到服务器,同样使用提供的流。一旦客户端写完消息,它就等待服务器读取消息并返回响应gRPC再次保证了单个RPC调用中的消息排序在客户端流RPC模式中,客户端会发送多个请...
- 我的模型我做主02——训练自己的大模型:简易入门指南
-
模型训练往往需要较高的配置,为了满足友友们的好奇心,这里我们不要内存,不要gpu,用最简单的方式,让大家感受一下什么是模型训练。基于你的硬件配置,我们可以设计一个完全在CPU上运行的简易模型训练方案。...
- 开源项目MessageNest打造个性化消息推送平台多种通知方式
-
今天介绍一个开源项目,MessageNest-可以打造个性化消息推送平台,整合邮件、钉钉、企业微信等多种通知方式。定制你的消息,让通知方式更灵活多样。开源地址:https://github.c...
- 使用投机规则API加快页面加载速度
-
当今的网络用户要求快速导航,从一个页面移动到另一个页面时应尽量减少延迟。投机规则应用程序接口(SpeculationRulesAPI)的出现改变了网络应用程序接口(WebAPI)领域的游戏规则。...
- JSONP安全攻防技术
-
关于JSONPJSONP全称是JSONwithPadding,是基于JSON格式的为解决跨域请求资源而产生的解决方案。它的基本原理是利用HTML的元素标签,远程调用JSON文件来实现数据传递。如果...
- 大数据Doris(六):编译 Doris遇到的问题
-
编译Doris遇到的问题一、js_generator.cc:(.text+0xfc3c):undefinedreferenceto`well_known_types_js’查找Doris...
- 网页内嵌PDF获取的办法
-
最近女王大人为了通过某认证考试,交了2000RMB,官方居然没有给线下教材资料,直接给的是在线教材,教材是PDF的但是是内嵌在网页内,可惜却没有给具体的PDF地址,无法下载,看到女王大人一点点的截图保...
- 印度女孩被邻居家客人性骚扰,父亲上门警告,反被围殴致死
-
微信的规则进行了调整希望大家看完故事多点“在看”,喜欢的话也点个分享和赞这样事儿君的推送才能继续出现在你的订阅列表里才能继续跟大家分享每个开怀大笑或拍案惊奇的好故事啦~话说只要稍微关注新闻的人,应该...
- 下周重要财经数据日程一览 (1229-0103)
-
下周焦点全球制造业PMI美国消费者信心指数美国首申失业救济人数值得注意的是,下周一希腊还将举行第三轮总统选举需要谷歌日历同步及部分智能手机(安卓,iPhone)同步日历功能的朋友请点击此链接,数据公布...
- PyTorch 深度学习实战(38):注意力机制全面解析
-
在上一篇文章中,我们探讨了分布式训练实战。本文将深入解析注意力机制的完整发展历程,从最初的Seq2Seq模型到革命性的Transformer架构。我们将使用PyTorch实现2个关键阶段的注意力机制变...
- 聊聊Spring AI的EmbeddingModel
-
序本文主要研究一下SpringAI的EmbeddingModelEmbeddingModelspring-ai-core/src/main/java/org/springframework/ai/e...
- 前端分享-少年了解过iframe么
-
iframe就像是HTML的「内嵌画布」,允许在页面中加载独立网页,如同在画布上叠加另一幅动态画卷。核心特性包括:独立上下文:每个iframe都拥有独立的DOM/CSS/JS环境(类似浏...
- 一周热门
- 最近发表
- 标签列表
-
- HTML 教程 (33)
- HTML 简介 (35)
- HTML 实例/测验 (32)
- HTML 测验 (32)
- HTML 参考手册 (28)
- JavaScript 和 HTML DOM 参考手册 (32)
- HTML 拓展阅读 (30)
- HTML中如何键入空格 (27)
- HTML常用标签 (29)
- HTML文本框样式 (31)
- HTML滚动条样式 (34)
- HTML5 浏览器支持 (33)
- HTML5 新元素 (33)
- HTML5 WebSocket (30)
- HTML5 代码规范 (32)
- HTML5 标签 (717)
- HTML5 标签 (已废弃) (75)
- HTML5电子书 (32)
- HTML5开发工具 (34)
- HTML5小游戏源码 (34)
- HTML5模板下载 (30)
- HTTP 状态消息 (33)
- HTTP 方法:GET 对比 POST (33)
- 键盘快捷键 (35)
- 标签 (226)