sparksql中如何实现对Sequoiadb数组类型字段的查询 - 代码天地

sparksql中如何实现对Sequoiadb数组类型字段的查询

其他 2019-01-29 00:20:32 阅读次数: 0

Sequoiadb数据库是国产的企业级分布式数据库，Sequoiadb本身是key-value格式的nosql数据库，上层使用spark做SQL解析层，本文介绍如何使用sparksql查询Sequoiadb数组。

下面举一个具体的例子来说明：

1. 在SDB中创建集合，里面包含数据对象

db.foo.createCL("array1", {ShardingKey:{_id:1}, ShardingType:"hash", AutoSplit:true})

db.foo.array1.insert({id:1, empList: [{name:"Tom", age:30}, {name:"Jack", age:40}]})

db.foo.array1.insert({id:2, empList: [{name:"Nacy", age:25}, {name:"Wendy", age:35}]})

db.foo.createCL("array2", {ShardingKey:{_id:1}, ShardingType:"hash", AutoSplit:true})

db.foo.array2.insert({id:3, empList: [{name:"Tom", age:30}, {name:"Jack", age:40}]})

db.foo.array2.insert({id:4, empList: [{name:"Nacy", age:25}, {name:"Wendy", age:35}]})

2. 在spark-sql中创建对应的数据表：

扫描二维码关注公众号，回复： 5108344 查看本文章

CREATE table sdb_array1 ( id int, empList array<struct<name:string, age:int>>) using com.sequoiadb.spark OPTIONS ( host 'sdbserver1:11810', collectionspace 'foo', collection 'array1');

CREATE table sdb_array2 ( id int, empList array<struct<name:string, age:int>>) using com.sequoiadb.spark OPTIONS ( host 'sdbserver1:11810', collectionspace 'foo', collection 'array2');

select * from sdb_array1;

select * from sdb_array2;

注意：

基本的模式是 array<TYPE> 和 struct<COLUMN:TYPE, COLUMN:TYPE, ...>，上面的用法是两者的组合。

Hive和Spark数据类型的说明，请参考下面的文档：

https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Types

https://spark.apache.org/docs/1.2.0/sql-programming-guide.html

3. 以数组中的特定信息作为查询条件：

select * from sdb_array1 where empList[0].name='Tom';

select * from sdb_array2 where empList[1].name='Wendy';

select * from sdb_array1 where empList[0].age=30;

select * from sdb_array2 where empList[1].age=35;

select * from sdb_array1 where empList[0].name='Tom' union all select * from sdb_array2 where empList[1].age=35;

猜你喜欢

转载自blog.csdn.net/u014439239/article/details/81906889

sparksql中如何实现对Sequoiadb数组类型字段的查询

Laravel 的 DB 查询是如何实现字段类型自动转的？

Oracle中如何查询CLOB字段类型的内容

SQL Server中的text类型字段要如何查询？

【巨杉数据库Sequoiadb】如何查看表字段的数据类型

mysql中关于bit类型字段的查询

mysql中存储字段类型的查询效率

SparkSql 实现两表查询

spark读取elasticsearch中数组类型的字段

sparksql查询hbase中的数据

【巨杉数据库Sequoiadb】【咨询】【数据操作】【聚集查询】在执行聚集查询时，字符类型的字段能否按照实际内容进行分组去重

hive中如何生成json类型的字段

日期类型的字段的查询

Oracle查询字段类型

golang如何构造pg, jsonb类型字段的动态查询条件

Mongo 关联查询、数组中的对象中的字段排序

mysql中int类型的字段没赋默认值为空时如何使用查询条件

mysql和mybatisPlus实现：datetime类型的字段范围查询

SQL Server 2000中查询表名,列名及字段类型

MySql 查询表中字段的数据类型

Mysql中float类型字段，=查询不出结果

使用MongoTemplate实现包含特定值的数组字段查询

【MongoDB】查询字段对应的数组中包含某个值

node中mongoose操作数组类型字段

oracle查询表字段和字段类型

mysql中SQL语句查询表字段名、注释、字段类型

sequoiadb模糊查询

java如何实现（数据库中没有对应的字段值）状态查询，启用，禁用，结束

如何查询Oracle中字段是否包含换行

Mysql 如何查询表名中包含某字段的表

今日推荐

Linus “吃狗粮”最积极！

开源日报 | Winamp播放器即将开源；生成式AI之战升级第二轮；Linus“吃狗粮”最积极；AI进入泡沫前期；吴泳铭为阿里云带来了什么？

NetBSD 禁止提交由 AI 生成的代码

Apache Doris 2.0.10 版本正式发布！

开源日报 | 大模型开战；大模型独角兽被曝卖身；周鸿祎建议谷歌开源所有产品；最大开源AI社区提供1000万美元共享GPU

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

周排行

SVN服务端安装在阿里云

实战 | 相机标定

webpack核心概念

note20——》只要肯低头吃苦，人生就会有救

PAT甲级 1062 Talent and Virtue （25 分）排序

NG Toolset开发笔记--5GNR Resource Grid（26）

如何对待上司

oracle命令

第9章 STL迭代器

logstash使用es映射模板

每日归档

更多

2024-05-20(36)

2024-05-19(0)

2024-05-18(4)

2024-05-17(34)

2024-05-16(6)

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)