红联Linux门户
Linux帮助

Apache Drill 1.9.0发布,大型数据集分析系统

发布时间:2016-12-01 09:22:54来源:红联作者:baihuo
Apache Drill 1.9.0 发布了。主要更新如下:

新特性:

Asynchronous Parquet reader

Parquet filter pushdown

Dynamic UDF support

HTTPD format plugin

改进:

Implement filter pushdown for Parquet

csv reader should allow newlines inside quotes

Make this option store.hive.optimize_scan_with_native_readers=true default

Malformed JSON should not stop the entire query from progressing

Allow casting to boolean the same literals as in Postgre

Remove submit_plan script from Drill distribution

Drill should avoid doing the same join twice if count(distinct) exists

Include session options used for a query as part of the profile

Improve parquet reader performance

Add ANSI format for date/time functions

Add ANSI format for date/time functions

Add support for Null Equality Joins

Adding template_name to source code generated using freemarker template

Allow users to customize the Drill log file name

Use ImpersonationUtil in RemoteFunctionRegistry

为了帮助企业用户寻找更为有效、加快Hadoop数据查询的方法,Apache 软件基金会发起了一项名为“Drill”的开源项目。Apache Drill 实现了 Google's Dremel.

该项目将会创建出开源版本的谷歌Dremel Hadoop工具(谷歌使用该工具来为Hadoop数据分析工具的互联网应用提速)。而“Drill”将有助于Hadoop用户实现更快查询海量数据集的目的。

下载地址:https://drill.apache.org/blog/2016/11/29/drill-1.9-released

来自:开源中国社区
文章评论

共有 0 条评论