Apache Flume 1.6.0 发布,此版本现已提供下载:
http://flume.apache.org/download.html
更新内容:
** Bug 修复
[FLUME-1793] - Unit test TestElasticSearchLogStashEventSerializer fails with IBM JDK
[FLUME-1934] - Spoolingdir source exception when reading multiple zero size files
[FLUME-2024] - Add ExecSource flush timeout to Flume User Guide
[FLUME-2126] - Problem in elasticsearch sink when the event body is a complex field
[FLUME-2148] - Windows : Add flume-env.ps1
[FLUME-2214] - FileChannelIntegrityTool needs documentation in user guide
[FLUME-2245] - HDFS files with errors unable to close
[FLUME-2337] - export JAVA_HOME in flume-env.sh.template and increase heap size
[FLUME-2346] - idLogFileMap in Log can lose track of file ids
[FLUME-2404] - Default maxReadBufferBytes might cause OOM and cause scribe source exit
[FLUME-2408] - Remove FLUME-1899 from Flume 1.5.0 change log.
[FLUME-2416] - Use CodecPool in compressed stream to prevent leak of direct buffers
[FLUME-2420] - HDFSEventSink#stop might throw ConcurrentModificationException
[FLUME-2425] - FileChannel should trim data and checkpoint directories
[FLUME-2432] - Update Kite dependency to 0.15.0
[FLUME-2438] - Make Syslog source message body configurable
[FLUME-2450] - Improve replay index insertion speed.
[FLUME-2466] - Embedded agent name cannot contain spaces
[FLUME-2470] - Kafka Sink and Source must use camel case for all configs.
[FLUME-2472] - SyslogUtils fails when system locale is not English
[FLUME-2475] - toLowerCase/toUpperCase used without Locale parameter to manipulate Enum
[FLUME-2479] - Kafka property auto.commit.enable is incorrect for KafkaSource
[FLUME-2480] - Typo in file channel exception: “Usable space exhaused”
[FLUME-2481] - TestFileChannelEncryption fails with System locale other than en_US
[FLUME-2482] - Race condition in File Channels’ Log.removeOldLogs
[FLUME-2486] - TestExecSource fails on some environments
[FLUME-2487] - SyslogParser has rounding errors in timestamp parsing
[FLUME-2488] - TestElasticSearchRestClient fails on Oracle JDK 8
[FLUME-2489] - Upgrade Apache Curator to 2.6.0
[FLUME-2492] - Flume’s Kafka Source doesn’t account time correctly
[FLUME-2495] - Kafka Source may miss events when channel is not available
[FLUME-2497] - TCP and UDP syslog sources parsing the timestamp incorrectly
[FLUME-2500] - Add a channel that uses Kafka
[FLUME-2505] - Test added in FLUME-2502 is flaky
[FLUME-2517] - Performance issue: SimpleDateFormat constructor takes 30% of HDFSEventSink.process()
[FLUME-2525] - flume should handle a zero byte .flumespool-main.meta file for the spooldir source
[FLUME-2538] - TestResettableFileInputStream fails on JDK 8
[FLUME-2556] - TestBucketPath.testDateFormatHours fails intermittently
[FLUME-2557] - DatasetSink doesn’t work with Kerberos
[FLUME-2560] - Kafka channel entries missing for output jar generation
[FLUME-2568] - Additional fix for TestReliableSpoolingFileEventReader
[FLUME-2578] - Kafka source throws NPE if Kafka record has null key
[FLUME-2586] - HDFS Sink should have an option to try rename even if close fails
[FLUME-2592] - Specify main manifest attribute in flume tools jar
[FLUME-2594] - Close Async HBase Client if there are large number of consecutive timeouts
[FLUME-2595] - Add option to checkpoint on file channel shutdown
[FLUME-2624] - Improve Hive Sink performance
[FLUME-2626] - Remove trustmanager-type from Thrift RPC client
[FLUME-2633] - Update Kite dependency to 1.0.0
[FLUME-2639] - SecureThriftRpcClient client privileged calls throw IllegalAccessError
[FLUME-2645] - ipFilter.rules property name is wrong
[FLUME-2654] - Memory channel error message related to byteCapacity needs refining
[FLUME-2657] - Upgrade to Hive 1.0
[FLUME-2658] - Upgrade to Hbase 1.0
[FLUME-2664] - Site profile fails due to dependency issues
[FLUME-2665] - Update documentation for hdfs.closeTries based on FLUME-2586
[FLUME-2670] - Modify dependencies to not pull hadoop/hive related jars
[FLUME-2673] - Remove unused import in TestLog
[FLUME-2678] - Upgrade xalan to 2.7.2 to take care of CVE-2014-0107 vulnerability
[FLUME-2679] - Make hbase-1 the default profile
** 依赖升级
[FLUME-2443] - org.apache.hadoop.fs.FSDataOutputStream.sync() is deprecated in hadoop 2.4
** 文档
[FLUME-1594] - Document JMX options
[FLUME-2509] - Add description for properties in Failover Sink Processor
[FLUME-2630] - Update documentation for Thrift Src/Sink SSL support and Kerberos support
[FLUME-2635] - Documentation for zookeeper dynamic configuration in flume
[FLUME-2668] - Document SecureThriftRpcClient/SecureRpcClientFactory in Flume Developer Guide
** 改进
[FLUME-1334] - Write an startscript for flume agents on Windows
[FLUME-1491] - Dynamic configuration from Zookeeper watcher
[FLUME-1521] - Document the StressSource
[FLUME-1710] - JSONEvent.getBody should not return null
[FLUME-1920] - Test case TestFileChannel fails when flume is built from paths containing the string “hadoop”
[FLUME-2237] - Example config in User Guide to use same notation for Agent name
[FLUME-2246] - event body data size can make it configurable for logger sinker
[FLUME-2273] - ElasticSearchSink: Add handling for header substitution in indexName
[FLUME-2373] - Enable configuration to switch thrift source serialization protocol
[FLUME-2385] - Flume spans log file with “Spooling Directory Source runner has shutdown” messages at INFO level
[FLUME-2401] - Optionally Compress Backup Checkpoint
[FLUME-2439] - Update DatasetSink for Kite 0.15.0
[FLUME-2462] - Remove use of deprecated methods in DatasetSink
[FLUME-2463] - Add support for Hive and HBase datasets to DatasetSink
[FLUME-2469] - DatasetSink should load dataset when needed, not at startup
[FLUME-2499] - Include Kafka Message Key in Event Header, Updated Comments
[FLUME-2502] - Spool source’s directory listing is inefficient
[FLUME-2558] - Update javadoc for StressSource
[FLUME-2562] - Metrics for Flafka components
[FLUME-2591] - Kite DatasetSink 2.0
[FLUME-2613] - Tool/script for deleting individual message from queue
[FLUME-2642] - Limit kerberos relogin attempt
** 新特性
[FLUME-1734] - Create a Hive Sink based on the new Hive Streaming support
[FLUME-2242] - Flume Sink and Source for Apache Kafka
[FLUME-2426] - Support interceptors in the Embedded Agent
[FLUME-2431] - Add simple regex search-and-replace interceptor
[FLUME-2570] - Add option to not pad date fields
[FLUME-2574] - SSL Support for Thrift Rpc
[FLUME-2631] - End to End authentication in Flume
更新日志和文档:
http://flume.apache.org/releases/1.6.0.html
Flume 是一个分布式、可靠和高可用的服务,用于收集、聚合以及移动大量日志数据,使用一个简单灵活的架构,就流数据模型。这是一个可靠、容错的服务。
来自:开源中国社区

