[Live preview] Without a "professional" vector database, can the big model be unplayable?

The vector database was once unknown, but with the explosive growth of generative AI large models, its popularity has been pushed to an unprecedented peak.

Not long ago, many vector database companies announced the completion of financing: Pinecone completed $100 million in Series B financing, WeaviateBV received $50 million in Series B financing, Chroma received $18 million in seed round financing, and Qdrant received $7.5 million in seed financing. The influx of capital can prove the popularity of the vector database.

In China, in addition to Zilliz, the leader in the vector database track, companies such as Dahua, Yunchuang Data, China Software, Neusoft, Daily Interactive, Transwarp and Meiya Pico are also betting on this a field.

Why, can't the big model be played without a professional vector database ?

On August 29th, the second issue of OSCHINA [Open Source Talk] specially invited 5 representative experts in the industry to answer this question from the following perspectives in the form of live broadcast.

  • Do we really need a "professional" vector database like Milvus, or is it enough to upgrade the vector function support like PostgreSQL, Redis, ElasticSearch, etc.?

  • When an enterprise is faced with choosing a product such as a vector database or a vector plug-in, what factors should it mainly consider?

  • Combined with the actual scene, compare various vector products: data scale, retrieval performance, memory usage, cost...

  • How do vector databases, vector plug-ins, or vector retrieval libraries achieve a balance between retrieval quality and retrieval speed? Which way is the better choice?

Live topic: Without a "professional" vector database, can the large model be unplayable?

Live broadcast time: 19:00-21:30, August 29

Live broadcast platform: "OSC Open Source Community" video account

Sponsor: Open Source China

Scan the QR code on WeChat to make an appointment for the live broadcast, welcome to join the OSC live chat group, let’s chat together~

Live benefits

  • Interactive lottery: Ask questions in the comment area of ​​the live broadcast, and the users who are answered by the live broadcast guests will get 1 OSC T-shirt, and the quota is not limited.

  • Lucky bag lottery: There will be multiple rounds of lottery draws during the live broadcast, and participants will have a chance to win OSC T-shirts, notebooks, mugs, cutting-edge technology books, etc.

See you in the live broadcast~


Special thanks to the live broadcast guests and cooperative communities:

  Live host:

    Chen Tianzhou, CEO of Bytebase, former head of Google cloud database technology, former head of database/R&D/collaboration platform of Ant Group.

  Sharing guests:

  • Xiaofan Luan, partner and technical director of Zilliz, member of the Technical Advisory Committee of LF AI & Data Foundation, architect and maintainer of Milvus community.
  • Tang Cheng, co-founder of Zhongqi Multiplier Technology, MVP of PolarDB open source community, author of "PostgreSQL Practice: From Small Worker to Expert", core member of PostgreSQL Chinese Community, member of the Standing Committee of PostgreSQL China User Association
  • Wang Yan, senior technical expert of Alibaba Cloud Intelligence, head of research and development of Tair vector search engine
  • Liu Xiaoguo, Chief Evangelist of Elastic China Community

Bytebase

Bytebase is an open source database CI/CD tool for DevOps teams, designed for developers and DBAs. It is also the only Database CI/CD product included in CNCF Landscape.

Official website link: https://www.bytebase.com/

GitHub repository: https://github.com/bytebase/bytebase

Zilliz

Zilliz is the pioneer and global leader of vector database systems, developing vector database systems for enterprise-level AI applications. As the creator of Milvus, the world's most popular open source vector database, Zilliz provides a new generation of database technology for AI applications, helping enterprises to easily develop AI applications. With the democratization of AI as its mission, Zilliz is committed to building an AI database management infrastructure and empowering more enterprises through vector databases.

Zilliz Chinese official website: https://zilliz.com.cn/

The kite

Milvus is the world's most popular, fastest iterative, and most mature open source vector database. It has the largest Chinese/overseas user and developer community. It has obtained more than 20,000 Stars on GitHub and more than 3 million downloads. Trusted by more than 1,000 business users around the world.

Milvus official website: https://milvus.io/

Kite GitHub: https://github.com/milvus-io/kitve

PolarDB Open Source Community

PolarDB is a family of cloud-native database products self-developed by Alibaba Cloud. It adopts the separation of storage and computing, and the integrated design of software and hardware. It not only has the low-cost advantage of distributed design, but also has the ease of use of centralized, which can meet the needs of large-scale application scenarios. .

In 2021, Alibaba Cloud will take database open source as an important strategic direction, and officially open source its self-developed core database product PolarDB, helping developers and customers quickly use Alibaba Cloud database product technology through the open source version, and participate in the iterative process of technology products.

PolarDB open source official website: https://openpolardb.com/

Three

Tair, a cloud-native in-memory database, is a cloud-native in-memory database developed by Alibaba Cloud. On the basis of being fully compatible with Redis, it provides rich data models and enterprise-level capabilities to help customers build real-time online scenarios. At the same time, the efficient combination of Tair and a new type of storage medium—persistent memory, reduces the cost by more than 30% compared with memory, and can achieve data persistence and provide performance similar to memory. At present, Tair has been widely used by customers in various industries such as government affairs, finance, manufacturing, medical care and pan-Internet to meet customers' high-speed query and computing scenarios.

Product official website: https://www.aliyun.com/product/apsaradb/kvstore/tair

Elastic

Elastic is a search-focused company. As the developer of the Elastic Stack (Elasticsearch, Kibana, Beats, and Logstash), Elastic builds self-managed and SaaS products that enable people to work in application search, site search, enterprise search, logs, APM, metrics, security, Consume data in real time at scale in use cases such as business analytics. Elastic is used by thousands of companies worldwide to power mission-critical systems, including Cisco, eBay, Goldman Sachs, Microsoft, Mayo Medical Center, NASA, The New York Times , Wikipedia and Verizon. Elastic is a distributed company founded in 2012 with Elasticians operating in countries around the world. Elastic also established a wholly-owned company "Elastic Search (Beijing) Information Technology Co., Ltd." in China in 2018.

Official website: elastic.co/cn/

Elastic China Community Official Blog: https://elasticstack.blog.csdn.net/

Zhongqi Multiplier Technology

As an ecological enterprise of the PolarDB open source database, Zhongqi Multiplier Technology is a national high-tech enterprise with "data empowerment, value innovation" as its development strategy and business orientation. Provide services of products and technical solutions around relational databases and distributed storage related technical fields for large and medium-sized enterprise users such as finance, communications, energy, and government.

Official website: www.csudata.com

PG Chinese Community

The PostgreSQL Chinese Community is a non-profit non-governmental organization. Currently, members join as volunteers. The purpose of establishment is to build a PG database technology ecosystem (kernel, user training institutions, manufacturers, service providers, software developers, and universities to form a "business A benign development ecosystem driven by two-way driving with interests); to help enterprises solve the problems of personnel training and enterprise commercial database costs, the community will publish the latest PostgreSQL information and PostgreSQL related technical articles on various operating platforms to promote the development of PG technology in China.

PG Chinese community official website: www.postgres.cn

GreatSQL Community

The GreatSQL community was established in 2021 and initiated by Wanli Database. It is committed to building domestic independent open source database versions and open source database technologies through open community cooperation, and promoting the prosperity and development of China's open source database and application ecology. GreatSQL is a domestic independent open source database suitable for financial-level applications. It has multiple core features such as high performance, high reliability, high usability, and high security. It can be used as an optional replacement for MySQL or Percona Server for online production environments , and is completely free and compatible with MySQL or Percona Server.

Official website link: https://greatsql.cn/

Gitee repository: https://gitee.com/GreatSQL

ink sky wheel

Motianlun is a professional data technology community in China. It was founded in 2019 and currently covers 400,000 database-related practitioners in China. It provides one-stop comprehensive services around the learning and growth of data people, and creates a portal website integrating news information, technical articles, online Q&A, event live broadcast, video courses, document reading, online operation and maintenance, etc. Motianlun is committed to creating a more innovative learning form in the new era, building a complete data knowledge system, and jointly building a warm technical community and a new data community aggregate.

Link: https://www.modb.pro/

Mechanical Industry Press

Link: http://www.cmpbook.com/

Established in 1950, Machinery Industry Press was the first science and technology publishing house established by the state after the founding of the People's Republic of China. Mechanical Industry Press (hereinafter referred to as Mechanical Society) is hosted by the Machinery Industry Information Research Institute and is currently affiliated to the State-owned Assets Supervision and Administration Commission of the State Council. It is the mission and pursuit of the Machinery Society to disseminate industrial technology, craftsman skills and industrial culture, and help improve my country's independent innovation capabilities.

Redis 7.2.0 was released, the most far-reaching version Chinese programmers refused to write gambling programs, 14 teeth were pulled out, and 88% of the whole body was damaged. Flutter 3.13 was released. System Initiative announced that all its software would be open source. The first large-scale independent App appeared , Grace changed its name to "Doubao" Spring 6.1 is compatible with virtual threads and JDK 21 Linux tablet StarLite 5: default Ubuntu, 12.5-inch Chrome 116 officially released Red Hat redeployed desktop Linux development, the main developer was transferred away Kubernetes 1.28 officially released
{{o.name}}
{{m.name}}

Guess you like

Origin my.oschina.net/u/3859945/blog/10100920