聊聊Spring AI的EmbeddingModel
zhezhongyun 2025-04-29 06:49 30 浏览
序
本文主要研究一下Spring AI的EmbeddingModel
EmbeddingModel
spring-ai-core/src/main/java/org/springframework/ai/embedding/EmbeddingModel.java
public interface EmbeddingModel extends Model<EmbeddingRequest, EmbeddingResponse> {
@Override
EmbeddingResponse call(EmbeddingRequest request);
/**
* Embeds the given text into a vector.
* @param text the text to embed.
* @return the embedded vector.
*/
default float[] embed(String text) {
Assert.notNull(text, "Text must not be null");
List<float[]> response = this.embed(List.of(text));
return response.iterator().next();
}
/**
* Embeds the given document's content into a vector.
* @param document the document to embed.
* @return the embedded vector.
*/
float[] embed(Document document);
/**
* Embeds a batch of texts into vectors.
* @param texts list of texts to embed.
* @return list of embedded vectors.
*/
default List<float[]> embed(List<String> texts) {
Assert.notNull(texts, "Texts must not be null");
return this.call(new EmbeddingRequest(texts, EmbeddingOptionsBuilder.builder().build()))
.getResults()
.stream()
.map(Embedding::getOutput)
.toList();
}
/**
* Embeds a batch of {@link Document}s into vectors based on a
* {@link BatchingStrategy}.
* @param documents list of {@link Document}s.
* @param options {@link EmbeddingOptions}.
* @param batchingStrategy {@link BatchingStrategy}.
* @return a list of float[] that represents the vectors for the incoming
* {@link Document}s. The returned list is expected to be in the same order of the
* {@link Document} list.
*/
default List<float[]> embed(List<Document> documents, EmbeddingOptions options, BatchingStrategy batchingStrategy) {
Assert.notNull(documents, "Documents must not be null");
List<float[]> embeddings = new ArrayList<>(documents.size());
List<List<Document>> batch = batchingStrategy.batch(documents);
for (List<Document> subBatch : batch) {
List<String> texts = subBatch.stream().map(Document::getText).toList();
EmbeddingRequest request = new EmbeddingRequest(texts, options);
EmbeddingResponse response = this.call(request);
for (int i = 0; i < subBatch.size(); i++) {
embeddings.add(response.getResults().get(i).getOutput());
}
}
Assert.isTrue(embeddings.size() == documents.size(),
"Embeddings must have the same number as that of the documents");
return embeddings;
}
/**
* Embeds a batch of texts into vectors and returns the {@link EmbeddingResponse}.
* @param texts list of texts to embed.
* @return the embedding response.
*/
default EmbeddingResponse embedForResponse(List<String> texts) {
Assert.notNull(texts, "Texts must not be null");
return this.call(new EmbeddingRequest(texts, EmbeddingOptionsBuilder.builder().build()));
}
/**
* Get the number of dimensions of the embedded vectors. Note that by default, this
* method will call the remote Embedding endpoint to get the dimensions of the
* embedded vectors. If the dimensions are known ahead of time, it is recommended to
* override this method.
* @return the number of dimensions of the embedded vectors.
*/
default int dimensions() {
return embed("Test String").length;
}
}
EmbeddingModel继承了Model接口,其入参类型为EmbeddingRequest,返回类型为EmbeddingResponse,它定义了call、embed接口,提供了embed、embedForResponse、dimensions的默认实现
EmbeddingRequest
spring-ai-core/src/main/java/org/springframework/ai/embedding/EmbeddingRequest.java
public class EmbeddingRequest implements ModelRequest<List<String>> {
private final List<String> inputs;
private final EmbeddingOptions options;
public EmbeddingRequest(List<String> inputs, EmbeddingOptions options) {
this.inputs = inputs;
this.options = options;
}
@Override
public List<String> getInstructions() {
return this.inputs;
}
@Override
public EmbeddingOptions getOptions() {
return this.options;
}
}
EmbeddingRequest实现了ModelRequest接口,其getInstructions返回的是List<String>
EmbeddingResponse
spring-ai-core/src/main/java/org/springframework/ai/embedding/EmbeddingResponse.java
public class EmbeddingResponse implements ModelResponse<Embedding> {
/**
* Embedding data.
*/
private final List<Embedding> embeddings;
/**
* Embedding metadata.
*/
private final EmbeddingResponseMetadata metadata;
/**
* Creates a new {@link EmbeddingResponse} instance with empty metadata.
* @param embeddings the embedding data.
*/
public EmbeddingResponse(List<Embedding> embeddings) {
this(embeddings, new EmbeddingResponseMetadata());
}
/**
* Creates a new {@link EmbeddingResponse} instance.
* @param embeddings the embedding data.
* @param metadata the embedding metadata.
*/
public EmbeddingResponse(List<Embedding> embeddings, EmbeddingResponseMetadata metadata) {
this.embeddings = embeddings;
this.metadata = metadata;
}
/**
* @return Get the embedding metadata.
*/
public EmbeddingResponseMetadata getMetadata() {
return this.metadata;
}
@Override
public Embedding getResult() {
Assert.notEmpty(this.embeddings, "No embedding data available.");
return this.embeddings.get(0);
}
/**
* @return Get the embedding data.
*/
@Override
public List<Embedding> getResults() {
return this.embeddings;
}
//......
}
EmbeddingResponse实现了ModelResponse接口,其result为Embedding类型
AbstractEmbeddingModel
spring-ai-core/src/main/java/org/springframework/ai/embedding/AbstractEmbeddingModel.java
public abstract class AbstractEmbeddingModel implements EmbeddingModel {
private static final Map<String, Integer> KNOWN_EMBEDDING_DIMENSIONS = loadKnownModelDimensions();
/**
* Default constructor.
*/
public AbstractEmbeddingModel() {
}
/**
* Cached embedding dimensions.
*/
protected final AtomicInteger embeddingDimensions = new AtomicInteger(-1);
/**
* Return the dimension of the requested embedding generative name. If the generative
* name is unknown uses the EmbeddingModel to perform a dummy EmbeddingModel#embed and
* count the response dimensions.
* @param embeddingModel Fall-back client to determine, empirically the dimensions.
* @param modelName Embedding generative name to retrieve the dimensions for.
* @param dummyContent Dummy content to use for the empirical dimension calculation.
* @return Returns the embedding dimensions for the modelName.
*/
public static int dimensions(EmbeddingModel embeddingModel, String modelName, String dummyContent) {
if (KNOWN_EMBEDDING_DIMENSIONS.containsKey(modelName)) {
// Retrieve the dimension from a pre-configured file.
return KNOWN_EMBEDDING_DIMENSIONS.get(modelName);
}
else {
// Determine the dimensions empirically.
// Generate an embedding and count the dimension size;
return embeddingModel.embed(dummyContent).length;
}
}
private static Map<String, Integer> loadKnownModelDimensions() {
try {
Properties properties = new Properties();
properties.load(new DefaultResourceLoader()
.getResource("classpath:/embedding/embedding-model-dimensions.properties")
.getInputStream());
return properties.entrySet()
.stream()
.collect(Collectors.toMap(e -> e.getKey().toString(), e -> Integer.parseInt(e.getValue().toString())));
}
catch (IOException e) {
throw new RuntimeException(e);
}
}
@Override
public int dimensions() {
if (this.embeddingDimensions.get() < 0) {
this.embeddingDimensions.set(dimensions(this, "Test", "Hello World"));
}
return this.embeddingDimensions.get();
}
}
AbstractEmbeddingModel实现了EmbeddingModel接口定义的dimensions方法,它在不同模块有不同的实现子类,比如spring-ai-openai的OpenAiEmbeddingModel、spring-ai-ollama的OllamaEmbeddingModel、spring-ai-minimax的MiniMaxEmbeddingModel等
OllamaEmbeddingAutoConfiguration
org/springframework/ai/model/ollama/autoconfigure/OllamaEmbeddingAutoConfiguration.java
@AutoConfiguration(after = RestClientAutoConfiguration.class)
@ConditionalOnClass(OllamaEmbeddingModel.class)
@ConditionalOnProperty(name = SpringAIModelProperties.EMBEDDING_MODEL, havingValue = SpringAIModels.OLLAMA,
matchIfMissing = true)
@EnableConfigurationProperties({ OllamaEmbeddingProperties.class, OllamaInitializationProperties.class })
@ImportAutoConfiguration(classes = { OllamaApiAutoConfiguration.class, RestClientAutoConfiguration.class,
WebClientAutoConfiguration.class })
public class OllamaEmbeddingAutoConfiguration {
@Bean
@ConditionalOnMissingBean
public OllamaEmbeddingModel ollamaEmbeddingModel(OllamaApi ollamaApi, OllamaEmbeddingProperties properties,
OllamaInitializationProperties initProperties, ObjectProvider<ObservationRegistry> observationRegistry,
ObjectProvider<EmbeddingModelObservationConvention> observationConvention) {
var embeddingModelPullStrategy = initProperties.getEmbedding().isInclude()
? initProperties.getPullModelStrategy() : PullModelStrategy.NEVER;
var embeddingModel = OllamaEmbeddingModel.builder()
.ollamaApi(ollamaApi)
.defaultOptions(properties.getOptions())
.observationRegistry(observationRegistry.getIfUnique(() -> ObservationRegistry.NOOP))
.modelManagementOptions(new ModelManagementOptions(embeddingModelPullStrategy,
initProperties.getEmbedding().getAdditionalModels(), initProperties.getTimeout(),
initProperties.getMaxRetries()))
.build();
observationConvention.ifAvailable(embeddingModel::setObservationConvention);
return embeddingModel;
}
}
OllamaEmbeddingAutoConfiguration在spring.ai.model.embedding为ollama时启用,它自动配置了OllamaEmbeddingModel
OllamaEmbeddingProperties
org/springframework/ai/model/ollama/autoconfigure/OllamaEmbeddingProperties.java
@ConfigurationProperties(OllamaEmbeddingProperties.CONFIG_PREFIX)
public class OllamaEmbeddingProperties {
public static final String CONFIG_PREFIX = "spring.ai.ollama.embedding";
/**
* Client lever Ollama options. Use this property to configure generative temperature,
* topK and topP and alike parameters. The null values are ignored defaulting to the
* generative's defaults.
*/
@NestedConfigurationProperty
private OllamaOptions options = OllamaOptions.builder().model(OllamaModel.MXBAI_EMBED_LARGE.id()).build();
public String getModel() {
return this.options.getModel();
}
public void setModel(String model) {
this.options.setModel(model);
}
public OllamaOptions getOptions() {
return this.options;
}
}
OllamaEmbeddingProperties主要是提供了OllamaOptions属性配置,具体可以参考
https://github.com/ggerganov/llama.cpp/blob/master/examples/main/README.md
OllamaInitializationProperties
org/springframework/ai/model/ollama/autoconfigure/OllamaInitializationProperties.java
@ConfigurationProperties(OllamaInitializationProperties.CONFIG_PREFIX)
public class OllamaInitializationProperties {
public static final String CONFIG_PREFIX = "spring.ai.ollama.init";
/**
* Chat models initialization settings.
*/
private final ModelTypeInit chat = new ModelTypeInit();
/**
* Embedding models initialization settings.
*/
private final ModelTypeInit embedding = new ModelTypeInit();
/**
* Whether to pull models at startup-time and how.
*/
private PullModelStrategy pullModelStrategy = PullModelStrategy.NEVER;
/**
* How long to wait for a model to be pulled.
*/
private Duration timeout = Duration.ofMinutes(5);
/**
* Maximum number of retries for the model pull operation.
*/
private int maxRetries = 0;
public PullModelStrategy getPullModelStrategy() {
return this.pullModelStrategy;
}
public void setPullModelStrategy(PullModelStrategy pullModelStrategy) {
this.pullModelStrategy = pullModelStrategy;
}
public ModelTypeInit getChat() {
return this.chat;
}
public ModelTypeInit getEmbedding() {
return this.embedding;
}
public Duration getTimeout() {
return this.timeout;
}
public void setTimeout(Duration timeout) {
this.timeout = timeout;
}
public int getMaxRetries() {
return this.maxRetries;
}
public void setMaxRetries(int maxRetries) {
this.maxRetries = maxRetries;
}
public static class ModelTypeInit {
/**
* Include this type of models in the initialization task.
*/
private boolean include = true;
/**
* Additional models to initialize besides the ones configured via default
* properties.
*/
private List<String> additionalModels = List.of();
public boolean isInclude() {
return this.include;
}
public void setInclude(boolean include) {
this.include = include;
}
public List<String> getAdditionalModels() {
return this.additionalModels;
}
public void setAdditionalModels(List<String> additionalModels) {
this.additionalModels = additionalModels;
}
}
}
OllamaInitializationProperties提供了spring.ai.ollama.init即ollama初始化的相关配置,其中ModelTypeInit可以指定初始化哪些额外的model
示例
pom.xml
<dependency>
<groupId>org.springframework.ai</groupId>
<artifactId>spring-ai-starter-model-ollama</artifactId>
</dependency>
配置
spring:
ai:
model:
embedding: ollama
ollama:
init:
timeout: 5m
max-retries: 0
embedding:
include: true
additional-models: []
base-url: http://localhost:11434
embedding:
enabled: true
options:
model: bge-m3:latest
truncate: true
example
@Test
public void testCall() {
EmbeddingRequest request = new EmbeddingRequest(List.of("Hello World", "World is big and salvation is near"),
OllamaOptions.builder()
.model("bge-m3:latest")
.truncate(false)
.build());
EmbeddingResponse embeddingResponse = embeddingModel.call(request);
log.info("resp:{}", JSON.toJSONString(embeddingResponse));
}
小结
Spring AI定义了EmbeddingModel接口,它继承了Model接口,其入参类型为EmbeddingRequest,返回类型为EmbeddingResponse,它定义了call、embed接口,提供了embed、embedForResponse、dimensions的默认实现;AbstractEmbeddingModel实现了EmbeddingModel接口定义的dimensions方法,它在不同模块有不同的实现子类,比如spring-ai-openai的OpenAiEmbeddingModel、spring-ai-ollama的OllamaEmbeddingModel、spring-ai-minimax的MiniMaxEmbeddingModel等;
OllamaEmbeddingAutoConfiguration在spring.ai.model.embedding为ollama时启用,它自动配置了OllamaEmbeddingModel。
doc
- embeddings
- ollama-embeddings
相关推荐
- Win10全新版本速览 全新图标设计焕然一新
-
来源:太平洋电脑网[PConline资讯]原定于今年下半年发布的Win1021H2(SunValley),再有几个月就要与我们见面了。之前我们已经陆续介绍过新版在开始菜单、窗口动效、通知中心等方...
- 第四周B组最佳选手Icon:陨落天团的小将,还是重生星球的偶像?
-
B组本来被认为是死亡之组,但在上周的异组对抗赛中,实力较强的几只队伍却先后被A组击沉,反而是之前一分难求的OMG2:1战胜了IG,其中Icon功不可没,两次秀翻全场的妖姬和与Juejue中野联动的实力...
- 系统小技巧:不花一分钱 让声音再大些
-
有时,笔记本扬声器的音量可能无法满足我们对声音播放的需求。点击系统托盘的小喇叭图标,音量调节滑动条调整到最大也无效,而添置大功率扬声器又不是我们所愿。这时,可通过调整系统本身的设置或利用第三方软件来解...
- DNF手游:70级毕业搭配曝光!需集齐四类装备,别指望无形升级了
-
随着dnf手游7.16版本的临近,各种70级装备的消息满天飞,真真假假难以辨别,但只要以“发布会爆料”为核心,主播十四的爆料为辅,就能确定70版本的毕业标准搭配,查理策划是打算“集百家之长”于一身,完...
- 学会这5个电脑设置 可解决99%的故障
-
现阶段复工大家都是宅家办公吧,所以电脑可谓是大救星,因为无论是“停课不停学”的网课,还是在家办公的穷社畜,都离不开电脑。所以今天小编就来教大家几招电脑的自救方法,让你可以在电脑出问题时从容面对,足不出...
- LOL:上单腕豪的取胜之道——掌控自身优势,融合装备特效
-
01前言腕豪这个英雄,在11.9版本之前都没有太多的出场机会,即使出现也大多是在辅助位置上,充当开团、先手控制的角色。但是在11.9版本以后,这个英雄突然之间仿佛飞升了一般,直接冲上T1级别上单,之前...
- 英雄联盟手游:一篇文章读懂所有龙buff属性效果,还不赶紧收藏
-
英雄联盟手游采用也是经典峡谷地图,看起来和端游并没有太大差异,但实际上为了适应手游节奏,官方也是进行了适当调整,所以包括红蓝buff,大龙小龙buff效果会有一定的差异,现在就来看看小龙和大龙具体属性...
- 双击打不开怎么办?双击文件夹显示属性的解决办法
-
今天小编在双击文件夹的时候,就是打不开,还弹出“属性”对话框,不知道大家有没有出现过,为避免出现这种情况时大家束手无策,今天小编就来为大家分享一下解决方法。1.检查键盘,看看“Alt”键是否卡住。由于...
- 柜子布局好超省空间,1㎡当10㎡用,收纳涨5倍!户型图直接抄
-
柜子不嫌多,就怕你家放不下!▼每个空间几乎都需要柜子客厅要有电视柜,卧室要能放下全家四季的衣柜,厨房还需要大容量的橱柜空间就这么小,想要榨出放柜子的地方,还要活得不拥挤简直太难了!其实全是你找错地方打...
- 制作幻灯片的另类方法(然后制作幻灯片)
-
如果提到制作幻灯片,很多人都会想到PowerPoint。PowerPoint提供了很多模板,但这些模板实用性并不太高;PowerPoint的交互还算比较方便,但不可否认,交互显得有些呆板,而且要想修改...
- 《心灵杀手重制版》画面晃动怎么办?画面晃动解决办法
-
针对《心灵杀手重制版》画面晃动的问题,以下是一些解决办法:修改游戏启动路径:打开游戏所在的文件夹,找到游戏图标的属性并打开。在属性中修改启动路径,加入“-noblur”参数。例如,如果游戏安装在“X:...
- win10网络图标是小地球怎么办(excel表格怎样选定区域打印)
-
win10系统右下角的网络图标突然变成地球图标,而且无法上网,提示无法连接到Internet。出现这个问题多数是因为win10正式版更新了或者是网络中断后遗症。那么win10网络图标变成地球怎么办呢?...
- 重看电脑任务栏的使用方法:windows 10小贴士
-
【环球科技综合报道】据日本Livedoor新闻网2月18日报道,windows电脑的屏幕下方配置的任务栏由于总是在电脑桌面上显示,可以说是一个映入眼帘的机会格外多的区域。像是把经常使用的软件图标设置在...
- 《地狱之门》攻略:四大元素属性介绍
-
《地狱之门》是近期比较看好的一款卡牌手游,绚丽的3D游戏画面以及创新的即时战斗系统给人耳目一新的感觉,非常值得一试。玩家不仅可以体验到卡牌收集的乐趣,更主要的是能体验到酣畅淋漓的战斗体验。下面小编就来...
- 一脸懵?DNF希洛克词条赋予属性触发条件解析
-
DNF希洛克已经开放两天,但是还是看到不少小伙伴对于希洛克词条赋予系统充满疑惑。今天就来简单易懂的给大家科普一下希洛克词条赋予系统里的弯弯绕绕。◎希洛克装备融合在希洛克获取了专属史诗之后,即可通过歌兰...
- 一周热门
- 最近发表
- 标签列表
-
- HTML 教程 (33)
- HTML 简介 (35)
- HTML 实例/测验 (32)
- HTML 测验 (32)
- JavaScript 和 HTML DOM 参考手册 (32)
- HTML 拓展阅读 (30)
- HTML常用标签 (29)
- HTML文本框样式 (31)
- HTML滚动条样式 (34)
- HTML5 浏览器支持 (33)
- HTML5 新元素 (33)
- HTML5 WebSocket (30)
- HTML5 代码规范 (32)
- HTML5 标签 (717)
- HTML5 标签 (已废弃) (75)
- HTML5电子书 (32)
- HTML5开发工具 (34)
- HTML5小游戏源码 (34)
- HTML5模板下载 (30)
- HTTP 状态消息 (33)
- HTTP 方法:GET 对比 POST (33)
- 键盘快捷键 (35)
- 标签 (226)
- HTML button formtarget 属性 (30)
- CSS 水平对齐 (Horizontal Align) (30)