本文旨在将来自 S3 的 .csv.gz 数据,批量摄取至 Druid. 其中:
- Apache Druid:
26.0.0
- 参考文档:
REPLACE all data
1
2
3
4
5
|
REPLACE INTO <target table>
OVERWRITE ALL
< SELECT query >
PARTITIONED BY <time granularity>
[ CLUSTERED BY <column list> ]
|
REPLACE specific time ranges
1
2
3
4
5
|
REPLACE INTO <target table>
OVERWRITE WHERE __time >= TIMESTAMP '<lower bound>' AND __time < TIMESTAMP '<upper bound>'
< SELECT query >
PARTITIONED BY <time granularity>
[ CLUSTERED BY <column list> ]
|