利用transformers提取图片特征，存储到Pinecone数据库 - 代码天地

利用transformers提取图片特征，存储到Pinecone数据库

企业开发 2023-07-21 03:59:28 阅读次数: 0

一、前言

1、连接Pinecone数据库，或者创建pinecone数据库

2、加载或下载CLIP模型

3、加载图片

4、利用transformers提取图片向量特征

5、存储到Pincecone数据库中

二、代码示例

import pinecone
import torch
import numpy as np
from PIL import Image
from transformers import CLIPProcessor, CLIPModel

# Initialize the Pinecone client
pinecone.init(api_key="Your-API-KEY", environment="Your-environment")

# 如果没有先注册，地址：https://www.pinecone.io/（需要梯子）

# 连接数据库
index_name = "img-index"
print(index_name)
index = pinecone.Index(index_name=index_name)

# 加载或下载CLIP模型
model_path = r"D:\Desktop\\clip_model.pt"
model_name = 'openai/clip-vit-base-patch16'
try:
    # 尝试从本地路径加载模型
    clip_model = torch.load(model_path)
    clip_processor = CLIPProcessor.from_pretrained(model_name)
except FileNotFoundError:
    # 如果本地未找到模型，则下载并保存模型（需要梯子）
    clip_processor = CLIPProcessor.from_pretrained(model_name)
    clip_model = CLIPModel.from_pretrained(model_name)
    torch.save(clip_model, model_path)


# 加载图片
image_path = r'D:\Desktop\tp.png'
print(image_path)

# Load the image and text
text = ["img"]

# Preprocess the image and text
inputs = clip_processor(text=text, images=Image.open(image_path), return_tensors="pt", padding=True, truncation=True)

# Forward pass through the model
with torch.no_grad():
    outputs = clip_model(**inputs)

# Get the image and text embeddings
# image_embeds 是图像在模型嵌入空间中的向量表示，用于计算图像之间的相似度或在其他图像相关任务中使用:512
# logits_per_image 是用于图像分类任务的预测得分，表示图像在不同类别上的概率分布:20
image_vectors = outputs.image_embeds.numpy().tolist()

print(image_vectors[0])

vectors = [
    (
        "vec22",  # Vector ID
        image_vectors,  # Dense vector values
        {"path": image_path}  # Vector metadata
    )
]
print(vectors)

# 将数据存储
upsert_response = index.upsert(
    vectors=vectors,
    namespace="img-namespace"
)

猜你喜欢

转载自blog.csdn.net/xun527/article/details/131808434

利用transformers提取图片特征，存储到Pinecone数据库

向量数据库：DeepLake、Pinecone、Chroma

embedding和向量数据库(pinecone)

数据库的存储系列———将图片存储到数据库

上传图片存储到数据库的几种方式

小程序存储图片到数据库流程总结

Qt存储图片到数据库案例实现（转）

图片存储到数据库中，通过Servlet+jsp进行图片的存储及展示

远程从数据库提取图片

图文上传和下载，图片存储到静态访问的static中，图片url存储到数据库中

基于oracle存储过程将图片blob对象存储到oracle数据库

《向量数据库指南》：向量数据库Pinecone插入数据教程（一）

Android端上传图片到后台，存储到数据库中

利用POI工具读取word文档并将数据存储到sqlserver数据库中

《向量数据库指南》：向量数据库Pinecone成本管理

《向量数据库指南》：向量数据库Pinecone管理索引教程（一）

《向量数据库指南》：向量数据库Pinecone类型和数量

《向量数据库指南》：向量数据库Pinecone组织教程

《向量数据库指南》：向量数据库Pinecone扩展索引

《向量数据库指南》：向量数据库Pinecone稀疏-密集嵌入

《向量数据库指南》：向量数据库Pinecone管理索引教程（二）

《向量数据库指南》：向量数据库Pinecone理解成本

《向量数据库指南》：向量数据库Pinecone多租户

《向量数据库指南》：向量数据库Pinecone集合

《向量数据库指南》：向量数据库Pinecone监控教程

《向量数据库指南》：向量数据库Pinecone快速入门

Java+mysql实现保存图片到数据库，以及读取数据库存储的图片

使用 IndexedDB 数据库存储图片

如何简单地利用Bitmap为中介储存图片到数据库中

Bert提取句子特征(pytorch_transformers)

今日推荐

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

《2024 年一季度互联网投融资运行情况》研究报告

报告：Django 仍然是 74% 开发者的首选

15 年前上了“FFmpeg 耻辱柱”，今天他还得谢谢咱——腾讯QQPlayer一雪前耻？

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

周排行

记一下去大梅沙的准备（2018-05-26）

Spring 注解事务

基于HTTP协议的客户端缓存

阿里云rds 备份和还原

[PHP] 几个拖慢 PHP 程序/API 运行速度的点

python 代码风格------------PEP8规则

js控制json生成菜单——自制菜单（一）

将字符串: 'k:1|k1:2|k2:3|k3:4 ' ,处理成 python 字典: {'k':1, 'k1':2, ...}

微信小程序转支付宝小程序

Qt551.窗口滚动条

每日归档

更多

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)