Skip to content

dataset sink support non-avro #191

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 387 commits into
base: flume-1.6
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
387 commits
Select commit Hold shift + click to select a range
0bbd0ad
FLUME-2250. Kafka Source.
harishreedharan Sep 17, 2014
186a3b8
FLUME-2455. Kafka Sink Documentation.
harishreedharan Sep 24, 2014
bde2c28
FLUME-2470. Kafka Sink and Source must use camel case for all configs.
harishreedharan Sep 24, 2014
26444fd
FLUME-2470. Kafka Sink and Source config updates. Missed a couple fil…
harishreedharan Sep 24, 2014
ef2b089
FLUME-2469. DatasetSink should load dataset when needed, not at startup.
harishreedharan Sep 25, 2014
6d02431
FLUME-2463. Add Hive and HBase dataset support in DatasetSink.
harishreedharan Sep 26, 2014
d2cb4ba
FLUME-1501. Flume Scribe Source needs unit tests.
harishreedharan Sep 30, 2014
e1e5ace
FLUME-2481. TestFileChannelEncryption fails with System locale other …
harishreedharan Oct 2, 2014
9f75c40
FLUME-2492. Flume's Kafka Source doesn't account time correctly
harishreedharan Oct 3, 2014
e1cb691
FLUME-2479. Kafka property auto.commit.enable is incorrect for KafkaS…
harishreedharan Oct 3, 2014
9b2c28d
FLUME-2479. Adding new test files.
harishreedharan Oct 3, 2014
9f0abea
FLUME-2495. Kafka Source may miss events when channel is not available
harishreedharan Oct 7, 2014
9968f0f
FLUME-2466. Embedded agent name cannot contain spaces
harishreedharan Oct 9, 2014
35cf900
FLUME-2346. idLogFileMap in Log can lose track of file ids.
harishreedharan Oct 9, 2014
72b54c9
FLUME-2472. SyslogUtils fails when system locale is not English
harishreedharan Oct 10, 2014
7ffbd86
FLUME-2489. Upgrade Apache Curator to 2.6.0
harishreedharan Oct 10, 2014
f17c7d5
FLUME-2475. toLowerCase/toUpperCase must specify Locale.
harishreedharan Oct 10, 2014
84c526f
FLUME-2237. Example config in User Guide to use same notation for Age…
Oct 11, 2014
8328bcc
FLUME-2126. Problem in elasticsearch sink when the event body is a co…
harishreedharan Oct 15, 2014
aa6fb7f
FLUME-1710. JSONEvent.getBody should not return null
harishreedharan Oct 15, 2014
a582c10
FLUME-2482. File Channel tests must disable scheduled checkpoint to a…
harishreedharan Oct 15, 2014
f979b26
FLUME-2502. Improve Spool Directory Source's performance by not listi…
harishreedharan Oct 16, 2014
622faa6
FLUME-2499. Include Kafka Message Key in Event Headers.
harishreedharan Oct 16, 2014
f99adaa
FLUME-2486. TestExecSource fails on some environments
harishreedharan Oct 16, 2014
3751434
FLUME-2511. Allow configuration of enabled protocols in Avro source a…
harishreedharan Oct 23, 2014
77d56e9
FLUME-2517. Cache SimpleDataFormat objects in bucketwriter for better…
harishreedharan Oct 27, 2014
ea2fcda
FLUME-2520: HTTP Source should be able to block a prefixed set of pro…
Oct 27, 2014
aef02df
FLUME-2500: Add a channel that uses Kafka
Oct 28, 2014
fdd61e8
FLUME-2523. Document Kafka channel
harishreedharan Oct 31, 2014
7e21ad3
FLUME-2533: HTTPS tests fail on Java 6
Nov 6, 2014
efbf87f
FLUME-2525. Handle a zero byte .flumespool-main.meta file for the spo…
harishreedharan Nov 6, 2014
543c4c4
Updating CHANGELOG for 1.5.1 release
harishreedharan Nov 6, 2014
29e31bf
Preparing for Flume-1.5.1 Release
harishreedharan Nov 6, 2014
76ddc82
FLUME-2505: Test added in FLUME-2502 is flaky
Nov 8, 2014
534fe26
FLUME-2497. Support fractional seconds in Syslog timestamps
mpercy Nov 11, 2014
2c18533
FLUME-2385. Remove incorrect log message at INFO level in Spool Direc…
harishreedharan Nov 11, 2014
8c7f693
FLUME-2548: Enable SSLv2Hello for Avro Source and NettyAvroRpcClient
Nov 12, 2014
94b25ae
FLUME-2549: Enable SSLv2Hello for HttpSource
Nov 12, 2014
293a8c0
FLUME-2551. Add dev-support directory to source tarball.
harishreedharan Nov 12, 2014
2b2ac63
Update RELEASE-NOTES for 1.5.2
harishreedharan Nov 12, 2014
619e78f
Update CHANGELOG for 1.5.2
harishreedharan Nov 12, 2014
698f0ba
FLUME-2538. TestResettableFileInputStream fails on JDK 8.
harishreedharan Nov 13, 2014
f20e0a9
FLUME-2509. Add description for properties in Failover Sink Processor
Nov 20, 2014
35de8ff
FLUME-1594. Document JMX options
Nov 20, 2014
a73b678
FLUME-1521. Document the StressSource
Nov 20, 2014
55a4666
FLUME-2487. SyslogParser has rounding errors in timestamp parsing
Nov 20, 2014
22cd390
FLUME-2480. Typo in file channel exception: "Usable space exhaused"
Nov 20, 2014
ac999be
FLUME-2246. Make event data size configurable for logger sinker
Nov 20, 2014
9dd7d95
FLUME-2560. Add Kafka Channel to dist tarballs.
harishreedharan Nov 21, 2014
cec64a5
FLUME-2557. Make DatasetSink work with Kerberos
mpercy Nov 21, 2014
d1227fe
FLUME-2488: TestElasticSearchRestClient fails on Oracle JDK 8
Nov 22, 2014
1003d1f
FLUME-2558. Update javadoc for StressSource
Nov 26, 2014
34e9bda
FLUME-2373. Support TBinaryProtocol in Thrift RPC.
harishreedharan Dec 18, 2014
8446566
FLUME-2556. BucketWriter DateFormat test fails intermittently.
harishreedharan Dec 18, 2014
199684b
FLUME-2578. Kafka source throws NPE if Kafka record has null key
harishreedharan Dec 30, 2014
1d9bab6
FLUME-2562. Add metrics for Kafka Source, Kafka Sink and Kafka Channel.
harishreedharan Jan 17, 2015
91c5880
FLUME-2568. Additional fix for TestReliableSpoolingFileEventReader
harishreedharan Jan 17, 2015
82631f8
FLUME-2594: Close Async HBase Client if there are large number of con…
Jan 21, 2015
0d6ecca
FLUME-2586. HDFS Sink must try to rename files even if close fails.
harishreedharan Jan 28, 2015
1d49ef7
FLUME-2591. DatasetSink 2.0
harishreedharan Jan 29, 2015
62a4cad
FLUME-1334. Windows startup scripts.
harishreedharan Jan 31, 2015
9907764
FLUME-2443: org.apache.hadoop.fs.FSDataOutputStream.sync() is depreca…
Feb 3, 2015
a7f9255
FLUME-1734. Add a Hive Sink based on Hive Streaming support.
harishreedharan Feb 10, 2015
6f6f69b
FLUME-2592. Fix help message in File Channel Integrity Tool.
harishreedharan Feb 12, 2015
a103a67
FLUME-2574. SSL support for Thrift RPC.
harishreedharan Feb 19, 2015
4a91456
FLUME-2214. File Channel Integrity Toll documentation.
harishreedharan Feb 19, 2015
407874b
FLUME-2633: Update Kite dependency to 1.0.0
Feb 25, 2015
3d03053
FLUME-2570. Add option to not pad date fields.
harishreedharan Feb 26, 2015
542b169
FLUME-2631. End to End authentication in Flume
harishreedharan Mar 6, 2015
49150fe
FLUME-2639. SecureThriftRpcClient client privileged calls throw Illeg…
harishreedharan Mar 10, 2015
39bc4da
FLUME-2642. Limit the number of calls to UGI.checkTGTAndRelogin method.
harishreedharan Mar 11, 2015
e60c22a
FLUME-2626. Remove trustmanager-type from Thrift RPC client and Thrif…
harishreedharan Mar 13, 2015
e98bcd4
FLUME-2630: Update documentation for Thrift SRc/Sink SSL support
Mar 14, 2015
c9289c0
FLUME-2624 : Improve Hive Sink performance
Apr 1, 2015
4e06f6f
FLUME-2657: Upgrade to Hive 1.0
Apr 7, 2015
4d2a34e
FLUME-1934. Spooling Directory Source dies on encountering zero-byte …
harishreedharan Apr 7, 2015
cfefda1
FLUME-2654. Clarify error message in MemoryChannel when byte capacity…
harishreedharan Apr 7, 2015
91ec579
FLUME-2613. Add support in FileChannelIntegrityTool to remove invalid…
harishreedharan Apr 7, 2015
c77f1ac
FLUME-2649. Elasticsearch sink doesn't handle JSON fields correctly
harishreedharan Apr 10, 2015
c3eff4b
FLUME-2664. Site profile fails due to dependency issues
harishreedharan Apr 10, 2015
b68f599
FLUME-2635. Documentation for zookeeper dynamic configuration in flume
harishreedharan Apr 10, 2015
fc03456
FLUME-2645. ipFilter.rules property name is wrong
harishreedharan Apr 10, 2015
be4ae29
FLUME-2595. Add option to checkpoint on file channel shutdown
harishreedharan Apr 13, 2015
a508d95
FLUME-2668. Document SecureThriftRpcClient/SecureRpcClientFactory in …
harishreedharan Apr 14, 2015
94f568b
FLUME-2148. Windows : Add flume-env.ps1
harishreedharan Apr 14, 2015
4188e4d
FLUME-2665. Update documentation for hdfs.closeTries based on FLUME-2586
harishreedharan Apr 14, 2015
275329c
FLUME-2670. Modify dependencies to not pull hadoop/hive related jars
harishreedharan Apr 15, 2015
364752a
FLUME-2673. Remove unused import in TestLog
harishreedharan Apr 15, 2015
019ab52
FLUME-2658. Upgrade to Hbase 1.0
harishreedharan Apr 15, 2015
651559f
FLUME-2675. Update pom.xml version in trunk and 1.7 branch
harishreedharan Apr 16, 2015
b92fbdc
FLUME-2676. Update flume docs and release notes to 1.7-SNAPSHOT in tr…
harishreedharan Apr 16, 2015
e1740c0
FLUME-2652. Documented transaction handling semantics incorrect in de…
harishreedharan Apr 16, 2015
0b72ca2
FLUME-2678. Refactor Hive dependencies.
harishreedharan Apr 17, 2015
8443062
FLUME-2679. Make hbase-1 the default profile
harishreedharan Apr 17, 2015
2f2c498
FLUME-2683. Update License file for 1.6 release
harishreedharan Apr 20, 2015
848f8ee
FLUME-2649. Elasticsearch sink doesn't handle JSON fields correctly
harishreedharan Apr 21, 2015
493976e
FLUME-2690. Update ChangeLog for 1.6 Release
harishreedharan Apr 28, 2015
f756003
FLUME-2690. Update ChangeLog for 1.6 Release
harishreedharan May 5, 2015
f3b6ceb
FLUME-2552 - add Roshan Naik in developers section in pom.xml
May 6, 2015
344e0ac
FLUME-2215. ResettableFileInputStream can't support ucs-4 character
harishreedharan May 27, 2015
d6943a6
FLUME-2575. FLUME-2548 brings SSLv2Hello back for Avro Sink, but UG s…
Jul 3, 2015
b5e102b
FLUME-2729. Allow pollableSource backoff times to be configurable
Jul 6, 2015
2ddc366
Patch for FLUME-2713
Jul 7, 2015
48126c2
FLUME-2732. Make maximum tolerated failures before shutting down and …
harishreedharan Jul 8, 2015
38f0b31
FLUME-2738. Fix file descriptor leak in AsyncHBaseSink when HBase clu…
harishreedharan Jul 9, 2015
5f6d6c3
FLUME-2737: Documentation for Pollable Source config parameters intro…
Jul 14, 2015
4d71247
FLUME-2628. Add an optional parameter to specify the expected input t…
Jul 14, 2015
baca45a
FLUME-2660. Add documentation for EventValidator
Jul 24, 2015
a494611
FLUME-2749. Fix kerberos configuration error when using short names i…
harishreedharan Jul 28, 2015
fd32492
FLUME-2660. Add documentation for EventValidator. Fixed doc formattin…
Jul 29, 2015
5239b61
FLUME-2485. Thrift Source tests fail on Oracle JDK 8
Aug 5, 2015
32ef64d
FLUME-2753 Error when specifying empty replace string in Search and R…
Aug 6, 2015
757a560
FLUME-2498. Implement Taildir Source
Aug 18, 2015
fff13b5
FLUME-2763. flume_env script should handle jvm parameters like -javaa…
Aug 20, 2015
318da20
FLUME-2754 - Hive Sink skipping first transaction in each Batch of Hi…
Aug 26, 2015
67189ca
FLUME-2672. NPE in KafkaSourceCounter
Sep 16, 2015
2cc93b7
FLUME-2095. JMS source with TIBCO
Sep 21, 2015
3fccd24
FLUME-2773. TailDirSource throws FileNotFound Exception if ~/.flume d…
harishreedharan Sep 25, 2015
bd80c5e
FLUME-2804. Hive sink - abort remaining transactions on shutdown
Sep 29, 2015
a2f55e1
FLUME-2751: Upgrade Derby version to 10.11.1.1
Sep 30, 2015
d8d97db
FLUME-2734: Kafka Channel timeout property is overridden by default v…
Sep 30, 2015
c8eb221
FLUME-2798. Malformed Syslog messages can lead to OutOfMemoryException
Oct 2, 2015
67ed62a
FLUME-2781. Kafka Channel with parseAsFlumeEvent=true should write da…
harishreedharan Oct 9, 2015
d6bf08b
FLUME-2632: High CPU on KafkaSink
Oct 21, 2015
0e40e83
FLUME-2593. ResettableFileInputStream returns negate values from read…
Oct 23, 2015
8bb5566
FLUME-2712. Optional channel errors slows down the Source to Main cha…
harishreedharan Oct 30, 2015
f38a521
FLUME-2835. Hive Sink tests need to create table with transactional p…
Nov 3, 2015
88b3fee
FLUME-2841: Upgrade commons-collections to 3.2.2
Nov 18, 2015
0421fa2
FLUME-2801. Performance improvement on TailDir source
Dec 17, 2015
a8dd873
FLUME-2806. flume-ng.ps1 Error running script to start an agent on Wi…
Dec 29, 2015
e6416a0
FLUME-2718: HTTP Source to support generic Stream Handler
Jan 15, 2016
af63d38
FLUME-2704. Configurable poll delay for spooling directory source
Jan 17, 2016
7962ce6
FLUME-2875. Allow RollingFileSink to specify a file prefix and a file…
harishreedharan Feb 9, 2016
de6ecf4
FLUME-2881. Windows Launch Script fails in plugins dir code
Feb 17, 2016
109ec30
FLUME-2886: Optional Channels can cause OOMs
Feb 23, 2016
ffb52b9
FLUME-2889. Fixes to DateTime computations
Feb 26, 2016
caa64a1
FLUME-2891: Revert FLUME-2712 and FLUME-2886
Mar 9, 2016
5293eba
FLUME-2897: AsyncHBase sink NPE when Channel.getTransaction() fails
Mar 29, 2016
f8abaf7
FLUME-2821: Flume-Kafka Source with new Consumer
Mar 29, 2016
7f588e6
FLUME-2822: Flume-Kafka-Sink with new Producer
Mar 29, 2016
e8c4a7b
FLUME-2823: Flume-Kafka-Channel with new APIs
Mar 29, 2016
4eb2a3b
FLUME-2852: Kafka Source/Sink should optionally read/write Flume records
Apr 21, 2016
e6df16d
FLUME-2889: Fixes to DateTime computations
Apr 25, 2016
46ae5ac
FLUME-2908: NetcatSource - SocketChannel not closed when session is b…
May 5, 2016
37ddd1b
FLUME-2909. Upgrade RAT to 0.11
mpercy May 18, 2016
f4f15be
FLUME-2913. Don't strip SLF4J from imported classpaths
mpercy May 27, 2016
a619cc1
FLUME-2915: The kafka channel using new APIs will be stuck when the s…
Jun 10, 2016
f46bee0
FLUME-2920: Kafka Channel Should Not Commit Offsets When Stopping
Jun 10, 2016
ee4999b
FLUME-2910. AsyncHBaseSink: Failure callbacks should log the exceptio…
mpercy Jun 10, 2016
9601f5b
User guide: fix mistake and formatting
dylan8902 Jun 14, 2016
1b9e589
FLUME-2923. Bump asynchbase version to 1.7.0
mpercy Jun 17, 2016
7d1e683
FLUME-2918. Speed up TaildirSource on directories with many files
mpercy Jun 20, 2016
4a7db4a
FLUME-2934. Document new cachePatternMatching option for TaildirSource
mpercy Jun 24, 2016
2f731f0
FLUME-2935. Bump java target version to 1.7
mpercy Jun 26, 2016
af4ea3f
Move README.textile to README.md and add license header
mpercy Jun 27, 2016
4eacba1
FLUME-2567. Remove external Maven repositories
smola Dec 3, 2014
2fe3938
FLUME-1899. Make SpoolDir work with subdirectories
mpercy Jun 28, 2016
2252fb1
FLUME-2937. Integrate checkstyle for non-test classes
mpercy Jun 26, 2016
c8c0f9b
Minor README tweaks
mpercy Jun 30, 2016
cfbf115
FLUME-2941. Integrate checkstyle for test classes
mpercy Jun 30, 2016
7013708
FLUME-2939. Update recursive SpoolDir source to use Java 7 APIs
mpercy Jul 9, 2016
19664c3
FLUME-2948. docs: Fix parameters on Replicating Channel Selector example
pippobaudos Jul 8, 2016
188c310
FLUME-2949. checkstyle: Make suppressions compatible with Windows
mpercy Jul 9, 2016
209c1e2
FLUME-2936. KafkaSource tests: Use loopback instead of host address
mpercy Jul 9, 2016
ba64b12
FLUME-2950. ReliableSpoolingFileEventReader.rollCurrentFile is broken
liorze Jul 10, 2016
ec28b66
FLUME-2725. HDFS Sink does not use configured timezone for rounding
adenes Jul 13, 2016
c7de4ba
FLUME-2952. SyslogAgent: Avoid NPE on stop()
mpercy Jul 12, 2016
dbf2e98
FLUME-2810. Add static Schema URL to AvroEventSerializer configuration
jholoman Jul 19, 2016
0259d30
Fix broken link in README
mpercy Jul 19, 2016
9965dae
Fix sphinx layout errors
mpercy Jul 19, 2016
358bb67
FLUME-2922. Sync SequenceFile.Writer before calling hflush
Jun 9, 2016
1ca0765
FLUME-2955. Add file path to the header in TaildirSource
tinawenqiao Jul 20, 2016
c554ba3
FLUME-2514. Speed up TestFileChannelRestart tests
smola Jul 20, 2016
10639e8
FLUME-2761. Move Hive sink out of preview mode
Jul 20, 2016
5a083a3
FLUME-2890. Typo in Twitter source warning
ccpds Jul 26, 2016
988ede9
FLUME-2959. Fix issues with flume-checkstyle module
liorze Jul 27, 2016
dff1505
Fix various typos
lfzCarlosC Jul 29, 2016
4b74aa2
FLUME-2963. FlumeUserGuide: Fix error in Kafka Source properties table
adenes Jul 29, 2016
1422f73
FLUME-2619. Spooldir source should log channel exceptions
Aug 9, 2016
c1fae53
FLUME-2901. Document Kerberos setup for Kafka channel
Aug 9, 2016
b6dede8
FLUME-2318: Make SpoolingDirectorySource able to handle empty files
Aug 17, 2016
118d5a4
FLUME-2844. SpillableMemoryChannel must start ChannelCounter
adenes Aug 17, 2016
e0deb72
FLUME-2975. docs: Fix NetcatSource example
Aug 21, 2016
9eb92da
FLUME-2974. Fix some broken SpoolDirSource unit tests
Aug 22, 2016
1e8f265
FLUME-2972. Handle offset migration in the new Kafka Channel
granthenke Aug 25, 2016
25e4bc6
FLUME-2954. Make raw data appearing in log messages explicit
Aug 29, 2016
53fb374
Upgrade to ASF pom version 18
mpercy Aug 30, 2016
330e572
FLUME-2983. Handle offset migration in the new Kafka Source
granthenke Sep 1, 2016
a0a5084
FLUME-2982. Add localhost escape sequence to HDFS sink
jholoman Sep 1, 2016
d9c9a7d
FLUME-2797. Use SourceCounter for SyslogTcpSource
adenes Sep 8, 2016
591b138
FLUME-2966. Fix NPE in JMS Source
c-sahuja Sep 26, 2016
fca8c5b
FLUME-2998. Add a missing configuration parameter to SequenceSource docs
Sep 30, 2016
27c7258
FLUME-3003. Fix flaky testSourceCounter in TestSyslogUdpSource
adenes Oct 7, 2016
7d5ceac
FLUME-2911. Add include pattern option in SpoolDir source
andrearota Aug 4, 2016
585c4c9
FLUME-2999. Kafka channel and sink should enable statically assigned …
Oct 10, 2016
2fd0d25
FLUME-2971. Add secure Kafka Sink/Source/Channel setup to the User Guide
Oct 10, 2016
d40abcb
Update RELEASE-NOTES for 1.7.0
bessbd Oct 10, 2016
5e400ea
Update CHANGELOG for 1.7
bessbd Oct 10, 2016
bb0eba0
FLUME-2924. Update POM versions to 1.8.0-SNAPSHOT
bessbd Oct 10, 2016
05feab4
Add flume-shared module to assembly xmls
adenes Oct 11, 2016
14ab61e
Update LICENSE
Oct 12, 2016
87d4c2c
Remove test dependencies from binary release artifact
mpercy Oct 12, 2016
94f1fab
Fix startup order for ExecSource
mpercy Oct 13, 2016
d6a73c8
Add .travis.yml
bessbd Oct 22, 2016
132063b
Fix .travis.yml
bessbd Oct 22, 2016
4b44dfc
Set MAVEN_OPTS for Travis-CI
bessbd Oct 24, 2016
2399329
FLUME-3002. Fix tests in TestBucketWriter
bessbd Oct 24, 2016
9b65219
Add Developer Section / How to Release from cwiki.apache.org
bessbd Oct 24, 2016
7c4b2fd
FLUME-2997. Fix flaky test in SpillableMemoryChannel
Oct 27, 2016
beddc40
Add Developer Section / How to Contribute from cwiki.apache.org
bessbd Oct 25, 2016
68f955d
Add Developer Section / How to Commit from cwiki.apache.org
bessbd Oct 28, 2016
5bf1d9b
Add Developer Section / Developers Quick Hack Sheet from cwiki.apache…
bessbd Oct 25, 2016
14fb4d8
FLUME-2857. Make Kafka Source/Channel/Sink restore default values whe…
Oct 28, 2016
911de00
Add e-mail templates to HowToRelease.md
bessbd Oct 25, 2016
f581f6e
FLUME-2812. Fix semaphore leak causing java.lang.Error: Maximum permi…
adenes Oct 28, 2016
9868c42
FLUME-3020. Improve HDFS Sink escape sequence substitution
TedBear42 Nov 9, 2016
72a68f0
Remove hostHeader = hostname property from Host interceptor example
peterableda Nov 18, 2016
de941e7
FLUME-2171. Add Interceptor to remove headers from event
gcommeau Nov 20, 2016
21a64db
FLUME-3025. Expose FileChannel.open on JMX
adenes Nov 21, 2016
83632c6
Fix KafkaSource and KafkaSink example configurations in the user guide
Nov 23, 2016
31c729e
Clean up references to the Flume 0.9 TailSource and recommendations t…
peterableda Nov 18, 2016
ed9f6ff
Fix Remove Header Interceptor properties table in FlumeUserGuide.rst
Nov 30, 2016
fa1ee05
FLUME-3027. Change Kafka Channel to clear offsets map after commit
jholoman Nov 23, 2016
2ff2dbb
FLUME-3031. Change sequence source to reset its counter for event bod…
Nov 22, 2016
5e52ac4
FLUME-2993. Add support for environment variables in configuration files
bessbd Jan 6, 2017
a844914
Add an HTTP sink
Jan 24, 2017
18453d3
Change how we build and sign release artifacts
mpercy Oct 12, 2016
f215374
FLUME-3049. Make HDFS sink rotate more reliably in secure mode
adenes Jan 27, 2017
6300832
FLUME-2427. Be less verbose in isFileClosed() notification
wang1026 Feb 17, 2017
4b6b841
Add Impala and Hive recommendation on hdfs.writeFormat setting in the…
Nov 18, 2016
73d8744
flume-checkstyle module should use ASF root pom as parent
mpercy Feb 24, 2017
e5c3e6a
FLUME-3080. Close failure in HDFS Sink might cause data loss
adenes Apr 3, 2017
03c8357
FLUME-2817. docs: Fix multi-agent flow example
marcellhegedus Mar 29, 2017
c718dae
FLUME-3043. Fix NPE in Kafka Sink and Channel
loleek Apr 16, 2017
dfa0627
FLUME-3083. Check byte position of file in update condition of Taildi…
Apr 9, 2017
ed433ae
FLUME-3085. HDFS Sink can skip flushing some BucketWriters, might lea…
adenes Apr 20, 2017
fdc53f3
FLUME-3092. Extend the FileChannel's monitoring metrics
adenes May 9, 2017
83e2569
FLUME-3100. Support arbitrary header substitution for topic of Kafka …
stakafum May 25, 2017
33d1904
FLUME-2945. Bump java target version to 1.8
liorze Jun 20, 2017
f2996cc
FLUME-3072. Add IP address to headers in Flume Log4j appender
Mar 15, 2017
857df3f
FLUME-2579 Support durable subscription in JMSSource
Mar 14, 2017
d01dfd3
Compile with jdk8 on Travis CI
Jun 29, 2017
b5e5ba5
FLUME-2752. Fix AvroSource startup resource leaks
Jun 29, 2017
5e9cfef
FLUME-2905. Fix NetcatSource file descriptor leak if startup fails
c-sahuja Jun 30, 2017
ef54f9f
FLUME-2917. Provide netcat UDP source as alternative to TCP
Jul 2, 2017
b54f085
FLUME-2175. Document Protocol Buffer upgrade steps
Jul 4, 2017
964bcf5
Facelift AvroSource and test using lambdas
Jun 29, 2017
c570a51
Fix HttpSink bad response handling
filippovmn Jul 12, 2017
1e69fc7
FLUME-2620. File Channel to support empty values in headers
marcellhegedus Jul 19, 2017
685b79a
FLUME-3135. Remove the references to StaticInterceptor from RegexFilt…
Jul 24, 2017
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -17,3 +17,4 @@ derby.log
.idea
*.iml
nb-configuration.xml
.DS_Store
26 changes: 26 additions & 0 deletions .travis.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.

language: java
jdk:
- oraclejdk8
before_install:
- export MAVEN_SKIP_RC="true" # Travis has settings in /etc/mavenrc. We want to override them. See https://github.com/travis-ci/travis-ci/issues/4613
install:
- # Skip mvn install. See https://docs.travis-ci.com/user/languages/java/
script:
- MAVEN_OPTS="-Xms512m -Xmx1024m" mvn clean install -DskipTests
534 changes: 421 additions & 113 deletions CHANGELOG

Large diffs are not rendered by default.

318 changes: 318 additions & 0 deletions CONTRIBUTING.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,318 @@
<!---
Licensed to the Apache Software Foundation (ASF) under one
or more contributor license agreements. See the NOTICE file
distributed with this work for additional information
regarding copyright ownership. The ASF licenses this file
to you under the Apache License, Version 2.0 (the
"License"); you may not use this file except in compliance
with the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing,
software distributed under the License is distributed on an
"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
KIND, either express or implied. See the License for the
specific language governing permissions and limitations
under the License.
-->
Apache Flume: How to Contribute
===============================

Welcome contributors! We strive to include everyone's contributions.
This page provides necessary guidelines on how to contribute effectively
towards furthering the development and evolution of Flume. You should
also read the guide on setting up Development Environment where you will find
details on how to checkout, build and test Flume.

**Note**: This guide applies to general contributors. If you are a
committer, please read the How to Commit guide as well.


What can be contributed?
------------------------

There are many ways you can contribute towards the project. A few of
these are:

**Jump in on discussions**: It is possible that someone initiates a
thread on the mailing list describing a problem that you have dealt with
in the past. You can help the project by chiming in on that thread and
guiding that user to overcome or workaround that problem or limitation.

**File Bugs**: If you notice a problem and are sure it is a bug, then go
ahead and [file a JIRA](https://issues.apache.org/jira/browse/Flume). If
however, you are not very sure that it is a bug, you should first
confirm it by discussing it on the [Mailing
Lists](https://cwiki.apache.org/confluence/display/FLUME/Mailing+Lists).

**Review Code**: If you see that a JIRA ticket has a "Patch Available"
status, go ahead and review it. It cannot be stressed enough that **you
must be kind in your review** and explain the rationale for your
feedback and suggestions. Also note that not all review feedback is
accepted - often times it is a compromise between the contributor and
reviewer. If you are happy with the change and do not spot any major
issues, then `+1` it. More information on this is available in the
following sections.

**Provide Patches**: We encourage you to assign the relevant JIRA issue
to yourself and supply a patch for it. The patch you provide can be
**code**, **documentation**, **build changes**, or any combination of
these. More information on this is available in the following sections.

Providing Patches
-----------------

In order to provide patches, follow these guidelines:

- **Make sure there is a JIRA**:
1. If you are working on fixing a problem that already has an
associated JIRA, then go ahead and assign it to yourself.
2. If it is already assigned to someone else, check with the
current assignee before moving it over to your queue.
3. If the current assignee has already worked out some part of the
fix, suggest that you can take that change over from them and
complete the remaining parts.



- **Attach the patches as you go through development**:
- While small fixes are easily done in a single patch, it is
preferable that you attach patches to the JIRA as you go along.
This serves as an early feedback mechanism where interested
folks can look it over and suggest changes where necessary. It
also ensures that if for some reason you are not able to find
the time to complete the change, someone else can take up your
initial patches and drive them to completion.

- **Before you submit your patch**:
1. Your change should be well-formatted and readable. Please use
two spaces for indentation (no tabs).
2. Carefully consider whether you have handled all boundary
conditions and have provided sufficiently defensive code
where necessary.
3. Add one or more unit tests, if your change is not covered by
existing automated tests.
4. Insert javadocs and code comments where appropriate.
5. Update the [Flume User
Guide](http://flume.apache.org/FlumeUserGuide.html)
([source](https://git-wip-us.apache.org/repos/asf?p=flume.git;a=blob;f=flume-ng-doc/sphinx/FlumeUserGuide.rst;hb=trunk))
if your change affects the Flume config file or any
user interface. Include those changes in your patch.
6. Make sure you update the relevant developer documentation, wiki
pages, etc. if your change affects the development
environment.

- **Test your changes before submitting a review**:
- Before you make the JIRA status as "Patch Available", please
test your changes thoroughly. Try any new feature or fix out for
yourself, and make sure that it works.
- Make sure that all unit/integration tests are passing, and that
the functionality you have worked on is tested through existing
or new tests.
- You can run all the tests by going to the root level of the
source tree and typing `mvn clean install`.

- **How to create a patch file**:
- The preferred naming convention for Flume patches is
`FLUME-12345.patch`, or `FLUME-12345-0.patch` where `12345` is
the JIRA number. You might want to name successive versions of
the patch something like `FLUME-12345-1.patch`,
`FLUME-12345-2.patch`, etc. as you iterate on your changes based
on review feedback and re-submit them.
- The command to generate the patch is "git diff". Example:

```
$ git diff > /path/to/FLUME-1234-0.patch
```


- **How to apply someone else's patch file**:
- You can apply someone else's patch with the GNU `patch` tool.
Example:

```
$ cd ~/src/flume # or wherever you keep the root of your Flume source tree
$ patch -p1 < FLUME-1234.patch
```

- Contributors may variously submit patches in a couple of
different formats. If you get some dialog from the `patch` tool
asking which file you want to patch, try variously the "-p1" or
"-p0" flags to `patch`. Without any additional arguments,
`git diff` generates patches that are applied using `patch -p1`.
If you use `git diff --no-prefix` to generate your patch, you
have to apply it using `patch -p0`. The ReviewBoard tool
understands both formats and is able to apply both
types automatically.


- **Submitting your patch for review:**
1. To submit a patch, attach the patch file to the JIRA and change
the status of the JIRA to "Patch Available".
2. If the change is non-trivial, please also post it for review on
the [Review
Board](https://reviews.apache.org/groups/Flume/).
Use the Repository "flume-git" on Review Board.
3. Link the JIRA to the Review Board review. JIRA has a feature you
can use for this by going to More Actions &gt; Link &gt; Web
Link when logged into JIRA.



- **Identify a reviewer:**
1. When posting on [review
board](https://reviews.apache.org/groups/Flume/)
(repository: "flume-git"), always add the Group "Flume" to the
list of reviewers.
2. Optionally, you may also add a specific reviewer to the review.
You can pick any of the project committers for review. Note that
identifying a reviewer does not stop others from reviewing
your change. Be prepared for having your change reviewed by
others at any time.
3. If you have posted your change for review and no one has had a
chance to review it yet, you can gently remind everyone by
dropping a note on the developer mailing list with a link to
the review.



- **Work with reviewers to get your change fleshed out**:
1. When your change is reviewed, please engage with the reviewer
via JIRA or review board to get necessary clarifications and
work out other details.
2. The goal is to ensure that the final state of your change is
acceptable to the reviewer so that they can +1 it.

Reviewing Code
--------------

Flume uses the [Apache Review
Board](https://reviews.apache.org/groups/Flume) for
doing code reviews. In order for a change to be reviewed, it should be
either posted on the review board or attached to the JIRA. If the change
is a minor change affecting only few lines and does not seem to impact
main logic of the affected sources, it need not be posted on the review
board. However, if the code change is large or otherwise impacting the
core logic of the affected sources, it should be posted on the review
board. Feel free to comment on the JIRA requesting the assignee to post
the patch for review on review board.

**Note**: Not all patches attached to a JIRA are ready for review.
Sometimes the patches are attached just to solicit early feedback
regarding the implementation direction. Feel free to look it over and
give your feedback in the JIRA as necessary. Patches are considered
ready for review either when the patch has been posted on review board,
or the JIRA status has been changed to 'Patch Available'. Find here a
[list of Flume JIRAs marked Patch
Available](https://issues.apache.org/jira/issues/?jql=project%20%3D%20flume%20AND%20status%20%3D%20%22Patch%20Available%22).

### Goals for Code Reviews

The net outcome from the review should be the same - which is to ensure
the following:

- Bugs/Omissions/Regressions are caught before the change is committed
to the source control.
- The change is subjected to keeping the quality of code high so as to
make the overall system sustainable. The implementation of the
change should be easily readable, documented where necessary, and
must favor simplicity of implementation.
- Changes are evaluated from the perspective of a consumer
(the reviewer) as opposed to the developer, which often brings out
subtleties in the implementation that otherwise go unnoticed.
- The change should be backward compatible and not require extensive
work on existing installations in order for it to be consumed. There
are exceptions to this in some cases like when work is done on a
major release, but otherwise backward compatibility should be upheld
at all times. If you are not clear, raise it is as a concern to be
clarified during the review.

### Code review guidelines

Following are some guidelines on how to do a code review. You may use
any other approach instead as long as the above stated goals are met.
That said, here is an approach that works fine generally:

- **Understand the problem being solved**: This often requires going
through the JIRA comments and/or mailing list threads where the
discussion around the problem has happened in the past. Look for key
aspects of the problem such as how it has impacted the users and
what, if any, is the suggested way to solve it. You may not find
enough information regarding the problem in some cases, in which
case - feel free to ask for clarification from the developer
contributing the change.



- **Think about how you would solve the problem**: There are many ways
to solve any code problem, with different ways having
different merits. Before proceeding to review the change, think
through how you would solve the problem if you were the one
implementing the solution. Note the various aspects of the problem
that your solution might have. Some such aspects to think about
are - impact on backward compatibility, overall usability of the
system, any impact on performance etc.



- **Evaluate the proposed change in contrast to your solution**:
Unless the change is obvious, it is likely that the implementation
of the change you are reviewing is very different from the solution
you would go for. Evaluate this change on the various aspects that
you evaluated your solution on in the previous step. See how it
measures up and give feedback where you think it could be improved.



- **Look for typical pitfalls**: Read through the implementation to
see if: it needs to be documented at places where the intention is
not clear; if all the boundary conditions are being addressed; if
the code is defensive enough; if any bad idioms have leaked in such
as double check locking etc. In short, check for things that a
developer is likely to miss in their own code which are otherwise
obvious to someone trying to read and understand the code.



- **See if the change is complete**: Check if the change is such that
it affects the user interface. If it does, then the documentation
should likely be updated. What about testing - does it have enough
test coverage or not? What about other aspects like license headers,
copyright statements etc. How about checkstyle and findbugs - did
they generate new warnings? How about compiler warnings?



- **Test the change**: It is very easy to test the change if you have
the development environment setup. Run as many tests as
you want with the patch. Manually test the change for functionality
that you think is not fully covered via the associated tests. If you
find a problem, report it.

### How to give feedback

Once you have collected your comments/concerns/feedback you need to send
it to back to the contributor. In doing so, please be as courteous as
possible and ensure the following:

- Your feedback should be clear and actionable. Giving
subjective/vague feedback does not add any value or facilitate a
constructive dialog.
- Where possible, suggest how your concern can be addressed. For
example if your testing revealed that a certain use-case is not
satisfied, it is acceptable to state that as is, but it would be
even better if you could suggest how the developer can address it.
Present your suggestion as a possible solution rather than
*the* solution.
- If you do not understand part of the change, or for some reason were
not able to review part of the change, state it explicitly so as to
encourage other reviewers to jump in and help.

Once you have provided your feedback, wait for the developer to respond.
It is possible that the developer may need further clarification on your
feedback, in which case you should promptly provide it where necessary.
In general, the dialog between the reviewer and developer should lead to
finding a reasonable middle ground where key concerns are satisfied and
the goals of the review have been met.

If a change has met all your criteria for review, please `+1` the change
to indicate that you are happy with it.
12 changes: 1 addition & 11 deletions DEVNOTES
Original file line number Diff line number Diff line change
Expand Up @@ -41,17 +41,7 @@ to most devs are "compile" -> "test" -> "package" -> "install".

Set MAVEN_OPTS to give the Flume build enough RAM to build.

export MAVEN_OPTS="-Xmx512M -XX:MaxPermSize=512M"

Note: If you see a permgen error (below), you need to increase the perm gen size.

[ERROR] PermGen space -> [Help 1]
[ERROR]
[ERROR] To see the full stack trace of the errors, re-run Maven with the -e switch.
[ERROR] Re-run Maven using the -X switch to enable full debug logging.
[ERROR]
[ERROR] For more information about the errors and possible solutions, please read the following articles:
[ERROR] [Help 1] http://cwiki.apache.org/confluence/display/MAVEN/OutOfMemoryError
export MAVEN_OPTS="-Xms512m -Xmx1024m"

Builds
------
Expand Down
Loading