Release Notes - ASF JIRA

Release Notes - Beam - Version 2.0.0 - HTML format

Configure Release Notes

Sub-task

[BEAM-772] - Implement Metrics support for Dataflow Runner
[BEAM-773] - Implement Metrics support for Flink runner
[BEAM-775] - Remove Aggregators from the Java SDK
[BEAM-827] - Remove PipelineOptions from construction time in WriteFiles
[BEAM-1617] - Add Gauge metric type to Java SDK
[BEAM-1651] - Add code style xml to the project repository
[BEAM-1684] - Add unit tests for iobase.py
[BEAM-1722] - Move PubsubIO out of the core SDK
[BEAM-1726] - Verify PAssert execution in TestFlinkRunner
[BEAM-1763] - TestPipeline should ensure that all assertions succeeded
[BEAM-1912] - Move HashingFn into io/common so it can be used by other tests
[BEAM-1958] - Standard IO Metrics in Java SDK
[BEAM-2002] - Verify PAssert execution in TestSparkRunner
[BEAM-2003] - Verify PAssert execution in TestDataflowRunner
[BEAM-2030] - Implement beam FileSystem's copy()
[BEAM-2031] - Hadoop FileSystem needs to receive Hadoop Configuration
[BEAM-2032] - Implement delete
[BEAM-2033] - Implement ResourceIds for HadoopFileSystem
[BEAM-2070] - Implement match for HadoopFileSystem
[BEAM-2329] - ABS Function
[BEAM-2330] - MOD Function
[BEAM-2331] - SQRT Function

Bug

[BEAM-145] - OutputTimeFn#assignOutputTime overrides WindowFn#getOutputTime in unfortunate ways
[BEAM-260] - WindowMappingFn: Know the getSideInputWindow upper bound to release side input resources
[BEAM-437] - Data-dependent BigQueryIO in batch
[BEAM-463] - BoundedHeapCoder should be a StandardCoder and not a CustomCoder
[BEAM-539] - Error when writing to the root of a GCS location
[BEAM-632] - Dataflow runner does not correctly flatten duplicate inputs
[BEAM-655] - Rename @RunnableOnService to something more descriptive
[BEAM-662] - SlidingWindows should support sub-second periods
[BEAM-828] - Remove PipelineOptions from construction time in BigQueryIO
[BEAM-1013] - Recheck all existing programming guide code snippets for correctness
[BEAM-1022] - WindowNamespace and WindowAndTriggerNamespace should not use Java object equality when comparing windows
[BEAM-1040] - Hadoop InputFormat - IO Transform for reads
[BEAM-1048] - Spark Runner streaming batch duration does not include duration of reading from source
[BEAM-1053] - ApexGroupByKeyOperator serialization issues
[BEAM-1068] - Service Account Credentials File Specified via Pipeline Option Ignored
[BEAM-1101] - Remove inconsistencies in Python PipelineOptions
[BEAM-1213] - WordCount example failure on Apex Runner
[BEAM-1247] - Session state should not be lost when discardingFiredPanes
[BEAM-1264] - Python ChannelFactory Raise Inconsistent Error for Local FS and GCS
[BEAM-1283] - DoFn finishBundle should be required to specify the window for output
[BEAM-1316] - DoFn#startBundle should not be able to output
[BEAM-1355] - HDFS IO should comply with PTransform style guide
[BEAM-1362] - Update the beam release process to include python sdk
[BEAM-1366] - Add metrics checks to Python SDK once metrics have been implemented
[BEAM-1381] - Implement DataflowMetrics.query method
[BEAM-1383] - Consistency in the Metrics examples
[BEAM-1402] - Make TextIO and AvroIO use best-practice types.
[BEAM-1414] - CountingInput should comply with PTransform style guide
[BEAM-1415] - PubsubIO should comply with PTransform style guide
[BEAM-1418] - MapElements and FlatMapElements should comply with PTransform style guide
[BEAM-1422] - ParDo should comply with PTransform style guide
[BEAM-1425] - Window should comply with PTransform style guide
[BEAM-1428] - KinesisIO should comply with PTransform style guide
[BEAM-1459] - Dataflow runner has deprecated metricsUpdates in favor of counterUpdates. Add setters.
[BEAM-1508] - PInput, POutput#expand should not be ordered
[BEAM-1546] - Specify exact version for Python in the SDK
[BEAM-1568] - Ineffective null check in IsmFormat#structuralValue
[BEAM-1569] - HDFSFileSource: Unable to read from filePattern with spaces in path
[BEAM-1571] - Flatten on a single input PCollection should have a test associated with it
[BEAM-1572] - Add per-stage matching of scope in metrics for the DirectRunner
[BEAM-1575] - Add ValidatesRunner test to PipelineTest.test_metrics_in_source
[BEAM-1578] - Runners should put PT overrides into a list rather than map
[BEAM-1579] - Runners should verify that PT overrides converged
[BEAM-1580] - Typo in bigquery_tornadoes example
[BEAM-1594] - Treat JOB_STATE_DRAINED as terminal in DataflowRunner
[BEAM-1629] - Metrics/aggregators accumulators should be instantiated before traversing pipeline
[BEAM-1635] - TypeError in AfterWatermark class's __repr__ method
[BEAM-1642] - Combine transformation evaluation fails on direct runner with Avro as a fallback coder
[BEAM-1644] - IO ITs: shared directory for kubernetes resources and PipelineOptions?
[BEAM-1645] - Display data not populated on Window.Assign
[BEAM-1649] - Fix unresolved references in Python SDK
[BEAM-1653] - Error when using PubsubIO with the DirectRunner
[BEAM-1656] - DirectRunner should not call finalize twice in UnboundedSourceExecutorFactory
[BEAM-1657] - DirectRunner should not call close twice in UnboundedSourceExecutorFactory
[BEAM-1671] - Support bypassing `validate` flag when using tfrecordio
[BEAM-1673] - PubSubIO can't write attributes
[BEAM-1676] - SdkCoreApiSurfaceTest Failed When Directory Contains Space
[BEAM-1686] - MQTT IO throws exception when client id is not specified
[BEAM-1690] - BigQueryTornadoesIT failing
[BEAM-1694] - Fix docstring inaccuracies in Python-SDK
[BEAM-1695] - Improve Python-SDK's programming guide
[BEAM-1709] - Implement Single-output ParDo as Multi-output ParDo
[BEAM-1711] - Document extra features on quick start guide
[BEAM-1713] - SparkRuntimeContext instances are leaking via StateSpecFunctions#mapSourceFunction
[BEAM-1718] - Returning Duration.millis(Long.MAX_VALUE) in DoFn.getAllowedTimestampSkew() causes Overflow/Underflow
[BEAM-1719] - Test modules are included in generated documentation
[BEAM-1721] - Reshuffle can shift elements in time
[BEAM-1723] - FlinkRunner should deduplicate when an UnboundedSource requires Deduping
[BEAM-1732] - Window.Assign does not properly populate DisplayData of the enclosing Window transform
[BEAM-1737] - Implement a Single-output ParDo as a Multi-output ParDo with a single output
[BEAM-1741] - Update runner pages for Python
[BEAM-1742] - UnboundedSource CheckpointMark should have more precise documentation
[BEAM-1751] - Singleton ByteKeyRange with BigtableIO and Dataflow runner
[BEAM-1762] - Python SDK Error Message no python 3 compatible
[BEAM-1767] - Remove Aggregators from Dataflow runner
[BEAM-1768] - assert_that always passes for empty inputs
[BEAM-1769] - Travis - python only executes py27 tox environment
[BEAM-1770] - DoFn javadoc claims no runner supports state or timers
[BEAM-1772] - Support merging WindowFn other than IntervalWindow on Flink Runner
[BEAM-1776] - Timers should be delivered in the window they were set in
[BEAM-1777] - If PipelineEnforcement throws an exception after Pipeline.run() fails, it overwrites the original failure
[BEAM-1780] - BigtableReader.splitIntoFraction should more carefully guard input
[BEAM-1784] - DataflowPipelineJob.cancel() should be idempotent
[BEAM-1792] - Spark runner uses its own filtering logic to match metrics
[BEAM-1793] - Frequent python post commit errors
[BEAM-1795] - Upgrade google-cloud-bigquery to 0.23.0
[BEAM-1801] - default_job_name can generate names not accepted by DataFlow
[BEAM-1802] - Spark Runner does not shutdown correctly when executing multiple pipelines in sequence
[BEAM-1803] - Metrics filters have a missmatch in class-based namespace
[BEAM-1810] - Spark runner combineGlobally uses Kryo serialization
[BEAM-1815] - Avoid shuffling twice in GABW
[BEAM-1818] - Expose side-channel inputs in PTransform
[BEAM-1828] - GlobalWatermarkHolder uses unpersist instead of destory
[BEAM-1832] - Potentially unclosed OutputStream in ApexYarnLauncher
[BEAM-1835] - NPE in DirectRunner PubsubReader.ackBatch
[BEAM-1837] - NPE in KafkaIO writer
[BEAM-1838] - GlobalWindow equals() and hashCode() doesn't work with other serialization frameworks
[BEAM-1842] - Stop matching composite PCollectionView PTransforms
[BEAM-1844] - test_memory_usage fails in post commit
[BEAM-1849] - Output from OnTimer method has windows re-assigned
[BEAM-1856] - HDFSFileSink class do not use the same configuration in master and slave
[BEAM-1862] - SplittableDoFnOperator should close the ScheduledExecutorService
[BEAM-1865] - Input Coder of GroupByKey should be a KV Coder in the Python SDK
[BEAM-1867] - Element counts missing on Cloud Dataflow when PCollection has anything other than hardcoded name pattern
[BEAM-1869] - getProducingTransformInternal should not be available on any PValue
[BEAM-1873] - Javadoc in BigQueryIO doesn't reflect recent changes
[BEAM-1886] - Remove TextIO override in Flink runner
[BEAM-1902] - Datastore IO never retries on errors
[BEAM-1903] - Splittable DoFn should report watermarks via ProcessContext
[BEAM-1904] - Remove DoFn.ProcessContinuation
[BEAM-1913] - TFRecordIO should comply with PTransform style guide
[BEAM-1914] - XML IO should comply with PTransform style guide
[BEAM-1922] - DataSource in JdbcIO is not closed
[BEAM-1926] - Need 3 Python snippets for composite transforms section in programming guide
[BEAM-1935] - DirectRunner Cancel should never throw a RejectedExecutionException
[BEAM-1937] - PipelineSurgery renumbers already-unique transforms
[BEAM-1947] - DisplayData raises exception when passed unicode string
[BEAM-1954] - "test" extra need nose in the requirements list
[BEAM-1963] - Quick start on home page redirects to java quickstart
[BEAM-1964] - Upgrade pylint to 1.7.0
[BEAM-1966] - ApexRunner in cluster mode does not register standard FileSystems/IOChannelFactories
[BEAM-1969] - GCP extras should not required fix version of proto-google-cloud-datastore-v1
[BEAM-1970] - Cannot run UserScore on Flink runner due to AvroCoder classload issues
[BEAM-1972] - HIFIO jdk module fails enforcer when only java 7 is installed on machine
[BEAM-1977] - PubsubIO fails with NPE on ACK when running locally
[BEAM-1981] - Serialization error with TimerInternals in ApexGroupByKeyOperator
[BEAM-1988] - utils.path.join does not correctly handle GCS bucket roots
[BEAM-1989] - clean SyntaxWarning
[BEAM-1992] - Count.perElement javadoc refers to Count.PerElement, but Count.PerElement is private
[BEAM-1998] - Update json_values_test.py for ValueProvider
[BEAM-2017] - DataflowRunner: fix NullPointerException that can occur when no metrics are present
[BEAM-2019] - Count.globally() requires default values for non-GlobalWindows
[BEAM-2022] - ApexTimerInternals seems to treat processing time timers as event time timers
[BEAM-2023] - BigQueryIO.Write needs a way of dynamically specifying table schemas
[BEAM-2029] - NullPointerException when using multi output ParDo in Spark runner in streaming mode.
[BEAM-2040] - Occasional build failures caused by AutoValue
[BEAM-2052] - Windowed file sinks should support dynamic sharding
[BEAM-2071] - AttributeError in dataflow_metrics
[BEAM-2072] - MicrobatchSource.reader stops reading after reaching maxNumRecords for the first time
[BEAM-2073] - Change SourceDStream.rateControlledMaxRecords() to better reflect its intention
[BEAM-2074] - SourceDStream's rate control mechanism may not work
[BEAM-2077] - Remove AvroCoder#createDatum(Reader/Writer)
[BEAM-2084] - Distribution metrics should be queriable in the Dataflow Runner
[BEAM-2086] - TestDataflowRunner relies on metrics which are not present in streaming jobs
[BEAM-2091] - Typo in build instructions in Apex Runner's README.md
[BEAM-2092] - MicrobatchSource can be relieved of some of its methods since it's never used as an actual BoundedSource
[BEAM-2093] - Update Jackson version to 2.8.8 in archetype (or align with parent pom)
[BEAM-2094] - WordCount examples produce garbage for non-English input text
[BEAM-2095] - The hasNext method of the iterator returned by SourceRDD#compute is not idempotent
[BEAM-2096] - NullPointerException in DataflowMetrics
[BEAM-2098] - Walkthrough URL in example code Javadoc is 404 not found
[BEAM-2105] - Audit that user-facing stuff is in main jars, not the test suite jars
[BEAM-2106] - NotSerializableException thrown when serializing EvaluationContext
[BEAM-2113] - Apex Runner is not able to submit any job to YARN
[BEAM-2114] - KafkaIO broken with CoderException
[BEAM-2116] - PubsubJsonClient doesn't write user created attributeMap
[BEAM-2119] - FileSystems doesn't install the local filesystem on intialization by default
[BEAM-2120] - DataflowPipelineJob processes all log messages with each waitUntilFinish
[BEAM-2122] - Writing to partitioned BigQuery tables from Dataflow is causing errors
[BEAM-2130] - Ensure options id is never null
[BEAM-2136] - AvroCoderTest.testTwoClassLoaders fails on beam_PostCommit_Java_ValidatesRunner_Dataflow
[BEAM-2143] - (Mis)Running Dataflow Wordcount gives non-helpful errors
[BEAM-2152] - Authentication fails if there is an unauthenticated gcloud tool even if application default credentials are available
[BEAM-2154] - Writing to large numbers of BigQuery tables causes out-of-memory
[BEAM-2157] - HadoopFileSystemModuleTest Failed in Some JDK Versions on Jenkins
[BEAM-2162] - Add logging during and after long running BigQuery jobs
[BEAM-2170] - PubsubIO.readStrings should handle messages without metadata
[BEAM-2181] - Upgrade Bigtable dependency to 0.9.6.2
[BEAM-2183] - Maven-archetypes should depend on all Beam modules that their sources compile against
[BEAM-2184] - OutputTimeFn is not a Fn: in Python, rename to TimestampCombiner
[BEAM-2187] - SparkRuntimeContextTest fails to compile
[BEAM-2190] - User depending on IO-GCP still gets a dependency on protobuf-lite
[BEAM-2205] - AttributeError when running datastore wordcount
[BEAM-2210] - PubsubIO.readPubsubMessagesWithoutAttributes is awkward
[BEAM-2211] - DataflowRunner (Java) rejects all but GCS paths for FileBasedSource/Sink
[BEAM-2212] - ValueProvider-ification of core transforms makes logs and errors worse
[BEAM-2213] - Java DirectRunner takes 60s to shut down after wordcount runs
[BEAM-2222] - Clean up readme files
[BEAM-2223] - java8 examples are not running
[BEAM-2224] - maptask_executor_runner_test fails in windows
[BEAM-2229] - GcsFileSystem attempts to create invalid Metadata
[BEAM-2233] - Java 8 examples should separate runners into distinct profiles, like Java 7 examples
[BEAM-2236] - Move test utilities out of python core
[BEAM-2239] - Step context not always available when exceptions raised.
[BEAM-2240] - Step context not always available when exceptions raised.
[BEAM-2242] - Apache Beam Java modules do not correctly shade test artifacts
[BEAM-2243] - org.apache.beam.GcpCoreApiSurfaceTest.testApiSurface fails at release-2.0.0 head
[BEAM-2244] - Move runner-facing Metrics classes to runners core
[BEAM-2249] - AvroIO does not handle partial reads
[BEAM-2256] - mongodb sdk MongoDbIO.BoundedMongoDbSource.splitKeysToFilters incorrect
[BEAM-2259] - Reshuffle may set watermark holds past the end of time
[BEAM-2260] - When using WindowedWrites and default FilenamePolicy, TextIO should throw at construction time
[BEAM-2275] - SerializableCoder fails to serialize when used with a generic type token
[BEAM-2277] - IllegalArgumentException when using Hadoop file system for WordCount example.
[BEAM-2279] - Hadoop file system support should be included in examples/archetype profiles of Spark runner.
[BEAM-2305] - Dinstinct transform produces unexpected output when triggered
[BEAM-2326] - Verbose INFO logging with stateful DoFns and Dataflow
[BEAM-2429] - Conflicting filesystems with used of HadoopFileSystem

New Feature

[BEAM-59] - Switch from IOChannelFactory to FileSystems
[BEAM-73] - IO design pattern: Decouple Parsers and Coders
[BEAM-135] - Utilities for "batching" elements in a DoFn
[BEAM-147] - Introduce an easy API for pipeline metrics
[BEAM-404] - PubsubIO should have a mode that supports maintaining message attributes.
[BEAM-596] - Support cancel() and waitUntilFinish() in DirectRunner
[BEAM-638] - Add sink transform to write bounded data per window, pane, [and key] even when PCollection is unbounded
[BEAM-846] - Decouple side input window mapping from WindowFn
[BEAM-885] - Move PipelineOptions from Pipeline.create() to Pipeline.run()
[BEAM-1047] - DataflowRunner: support regionalization.
[BEAM-1076] - DatastoreIO template Options
[BEAM-1195] - Give triggers a cross-language serialization schema
[BEAM-1198] - ViewFn: explicitly decouple runner materialization of side inputs from SDK-specific mapping
[BEAM-1327] - Replace OutputTimeFn with enum
[BEAM-1328] - Serialize/deserialize WindowingStrategy in a language-agnostic manner
[BEAM-1397] - Introduce IO metrics
[BEAM-1398] - KafkaIO metrics
[BEAM-1441] - Add FileSystem support to Python SDK
[BEAM-1855] - Support Splittable DoFn in Flink Streaming runner
[BEAM-1960] - Hadoop InputFormat - Add Kubernetes large and small cluster Scripts for Cassandra and Elasticsearch tests
[BEAM-2005] - Add a Hadoop FileSystem implementation of Beam's FileSystem
[BEAM-2054] - Upgrade dataflow.version to v1b3-rev196-1.22.0
[BEAM-2147] - Re-enable UsesTimersInParDo tests for DataflowRunner

Improvement

[BEAM-447] - Stop referring to types with Bound/Unbound
[BEAM-649] - Smarter caching of RDDs
[BEAM-720] - Run WindowedWordCount Integration Test in Flink
[BEAM-806] - Maven Release Plugin Does Not Set Archetype Versions
[BEAM-818] - ValueProvider for tempLocation, runner, etc, that is unavailable to transforms during construction
[BEAM-831] - ParDo Chaining
[BEAM-848] - Shuffle input read-values to get maximum parallelism.
[BEAM-911] - Mark API of multiple IOs as @Experimental
[BEAM-1071] - Support pre-existing tables with streaming BigQueryIO
[BEAM-1074] - Set default-partitioner in SourceRDD.Unbounded.
[BEAM-1148] - Port PAssert away from Aggregators
[BEAM-1179] - Update assertions of source_test_utils from camelcase to underscore-separated
[BEAM-1182] - Direct runner should enforce encodability of unbounded source checkpoints
[BEAM-1199] - Condense recordAsOutput, finishSpecifyingOutput from POutput
[BEAM-1242] - convert older IO/Sources to use standard ReadTransform style
[BEAM-1269] - BigtableIO should make more efficient use of connections
[BEAM-1272] - Align the naming of "generateInitialSplits" and "splitIntoBundles" to better reflect their intention
[BEAM-1294] - Long running UnboundedSource Readers
[BEAM-1336] - A StateSpec that doesn't care about the key shouldn't be forced to declare it as type Object
[BEAM-1337] - Use our coder infrastructure for coders for state
[BEAM-1340] - Remove or make private public bits of the SDK that shouldn't be public
[BEAM-1345] - Mark @Experimental and @Internal where needed in user-facing bits of the codebase
[BEAM-1401] - Sinks in Beam should supported windowed unbounded PCollections
[BEAM-1447] - Autodetect streaming/not streaming in DataflowRunner
[BEAM-1491] - HadoopFileSystemOptions should be able to read the HADOOP_CONF_DIR(YARN_CONF_DIR) environment variable
[BEAM-1514] - change default timestamp in KafkaIO
[BEAM-1520] - Implement TFRecordIO (Reading/writing Tensorflow Standard format)
[BEAM-1530] - BigQueryIO should support value-dependent windows
[BEAM-1539] - Support unknown length iterables for IterableCoder in Python SDK
[BEAM-1562] - Use a "signal" to stop streaming tests as they finish.
[BEAM-1573] - KafkaIO does not allow using Kafka serializers and deserializers
[BEAM-1633] - Move .tox/ directory under target/ in Python SDK
[BEAM-1660] - withCoder() error in JdbcIO JavaDoc example
[BEAM-1661] - shade guava in beam-sdks-java-io-jdbc
[BEAM-1672] - Accumulable MetricsContainers.
[BEAM-1689] - Apply changes for Flink's StatefulDoFnRunner to the primary StatefulDoFnRunner
[BEAM-1693] - Detect supported Python & pip executables in Python-SDK
[BEAM-1704] - Create.TimestampedValues should take a TypeDescriptor as an alternative to explicitly specifying the Coder
[BEAM-1708] - Better error messages when GCP features are not installed
[BEAM-1727] - Add setForNowAlign(period, offset) to Timer
[BEAM-1740] - Update bigtable version to 0.9.5.1
[BEAM-1743] - View.AsSingleton should be implemented in terms of a Global Combine, not the reverse
[BEAM-1749] - Upgrade pep8 to pycodestyle
[BEAM-1786] - AutoService registration of coders, like we do with PipelineRunners
[BEAM-1794] - Bigtable: improve user agent
[BEAM-1799] - IO ITs: simplify data loading design pattern
[BEAM-1807] - IO ITs: shared language neutral directory for kubernetes resources
[BEAM-1812] - Allow configuring checkpoints in Flink Runner PipelineOptions
[BEAM-1827] - Fix use of deprecated Spark APIs in the runner.
[BEAM-1829] - MQTT message compression not working on Rapsberry Pi
[BEAM-1830] - add 'withTopic()' api to KafkaIO Reader
[BEAM-1839] - Optimize StatelessJavaSerializer
[BEAM-1851] - Sample.fixedSizedGlobally documentation should include single worker memory constraint
[BEAM-1858] - improve error message when Create.of() is called with an empty iterator
[BEAM-1863] - Allow users to override the base container image but still choose image type
[BEAM-1864] - Shorten combining state names: "CombiningValue" and "AccumulatorCombiningState" to Combining (as appropriate)
[BEAM-1870] - ByteKey / ByteKeyRangeTracker should not use ByteString on public API surface
[BEAM-1871] - Thin Java SDK Core
[BEAM-1875] - Remove Spark runner custom Hadoop and Avro IOs.
[BEAM-1876] - GroupIntoBatches may be able to use Combine.BinaryCombineLongFn
[BEAM-1877] - Use Iterables.isEmpty in GroupIntoBatches
[BEAM-1882] - Jdbc k8s scripts: switch pod -> replicaController
[BEAM-1895] - Create tranform in python sdk should be a custom source
[BEAM-1897] - Remove Sink
[BEAM-1907] - Delete PubsubBoundedReader
[BEAM-1908] - Allow setting CREATE_NEVER when using a tablespec in BigQueryIO
[BEAM-1921] - expose connectionProperties in JdbcIO
[BEAM-1923] - Improve python log messages for temporary BigQuery tables
[BEAM-1949] - Rename DoFn.Context#sideOutput to #output
[BEAM-1990] - Window.Assign should not be public since it is not meant to be used publicly
[BEAM-1991] - Update references to SumDoubleFn => Sum.ofDoubles
[BEAM-1993] - Remove special unbounded Flink source/sink
[BEAM-1994] - Remove Flink examples package
[BEAM-2013] - Upgrade to Jackson 2.8.8
[BEAM-2014] - Upgrade to Google Auth 0.6.1
[BEAM-2020] - Move CloudObject to Dataflow runner
[BEAM-2021] - Fix Java's Coder class hierarchy
[BEAM-2044] - Downgrade HBaseIO to use the stable HBase client version (1.2.x)
[BEAM-2047] - PubsubStreamingWrite should use the input coder by default
[BEAM-2049] - Remove KeyedCombineFn
[BEAM-2051] - Reduce scope of the PCollectionView interface
[BEAM-2060] - XmlIO use harcoded Charset
[BEAM-2062] - EventHandler jaxb unmarshaller should be optional
[BEAM-2067] - Add support for generic CoderProvider -> CoderFactory mapping with CoderRegistrar
[BEAM-2068] - Upgrade Google-Apitools to latest version
[BEAM-2075] - Update flink runner to use flink version 1.2.1
[BEAM-2076] - DirectRunner: minimal transitive API surface
[BEAM-2099] - Create a WordCount example that works with HDFS
[BEAM-2135] - Rename hdfs module to hadoop-file-system, rename gcp-core to google-cloud-platform-core
[BEAM-2144] - Do not publish javadoc for Java SDK's util directory
[BEAM-2165] - Support custom user Jackson modules for PipelineOptions
[BEAM-2166] - Remove Coder.Context from the public API
[BEAM-2174] - Allow coder factories to create Coders for a wider range of types
[BEAM-2206] - Move pipeline options into separate package from beam/utils
[BEAM-2218] - PubsubIO.readPubsubMessages function names are too long
[BEAM-2221] - Make KafkaIO coder specification less awkward
[BEAM-2241] - Correctly mark top level classes and functions as private
[BEAM-2245] - Remove user-facing Timer.cancel() until further notice
[BEAM-2250] - Remove FnHarness code from PyDocs
[BEAM-3770] - The problem of kafkaIO sdk for data latency

Test

[BEAM-1184] - Add integration tests for ElasticsearchIO
[BEAM-1622] - Java: Rename RunnableOnService to ValidatesRunner
[BEAM-1752] - Tag Spark runner tests that recover from checkpoint.
[BEAM-2057] - Test metrics are reported to Spark Metrics sink.
[BEAM-2368] - one throw "Unable to find registrar for hdfs" with same code
[BEAM-3383] - Create validates runner metrics tests

Wish

[BEAM-378] - Integrate Python SDK in the Maven build
[BEAM-797] - A PipelineVisitor that creates a Spark-native pipeline.
[BEAM-1648] - Replace gsutil calls with Cloud Storage API

Task

[BEAM-825] - Fill in the documentation/runners/apex portion of the website
[BEAM-1027] - Hosting data stores to enable IO Transform testing
[BEAM-1353] - Beam should comply with PTransform style guide
[BEAM-1764] - Remove Aggregators from Flink Runner
[BEAM-1765] - Remove Aggregators from Spark runner
[BEAM-1766] - Remove Aggregators from Apex runner
[BEAM-1797] - add CoGroupByKey to chapter 'Using GroupByKey'
[BEAM-1887] - Switch ParDo execution to use new DoFn in Apex runner
[BEAM-1915] - Remove OldDoFn dependency in ApexGroupByKeyOperator
[BEAM-2016] - Delete HDFSFileSource/Sink
[BEAM-2124] - Deprecate <pipeline>.options usage
[BEAM-2139] - Disable SplittableDoFn ValidatesRunner tests for Streaming Flink Runner
[BEAM-2180] - Upgrade Apex dependency to 3.6.0
[BEAM-2235] - Restore wordcount example to its previous state

Edit/Copy Release Notes

The text area below allows the project release notes to be edited and copied to another document.

Release Notes - Beam - Version 2.0.0
    
<h2>        Sub-task
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-772'>BEAM-772</a>] -         Implement Metrics support for Dataflow Runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-773'>BEAM-773</a>] -         Implement Metrics support for Flink runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-775'>BEAM-775</a>] -         Remove Aggregators from the Java SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-827'>BEAM-827</a>] -         Remove PipelineOptions from construction time in WriteFiles
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1617'>BEAM-1617</a>] -         Add Gauge metric type to Java SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1651'>BEAM-1651</a>] -         Add code style xml to the project repository
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1684'>BEAM-1684</a>] -         Add unit tests for iobase.py
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1722'>BEAM-1722</a>] -         Move PubsubIO out of the core SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1726'>BEAM-1726</a>] -         Verify PAssert execution in TestFlinkRunner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1763'>BEAM-1763</a>] -         TestPipeline should ensure that all assertions succeeded
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1912'>BEAM-1912</a>] -         Move HashingFn into io/common so it can be used by other tests
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1958'>BEAM-1958</a>] -         Standard IO Metrics in Java SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2002'>BEAM-2002</a>] -         Verify PAssert execution in TestSparkRunner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2003'>BEAM-2003</a>] -         Verify PAssert execution in TestDataflowRunner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2030'>BEAM-2030</a>] -         Implement beam FileSystem&#39;s copy()
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2031'>BEAM-2031</a>] -         Hadoop FileSystem needs to receive Hadoop Configuration
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2032'>BEAM-2032</a>] -         Implement delete
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2033'>BEAM-2033</a>] -         Implement ResourceIds for HadoopFileSystem
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2070'>BEAM-2070</a>] -         Implement match for HadoopFileSystem
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2329'>BEAM-2329</a>] -         ABS Function
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2330'>BEAM-2330</a>] -         MOD Function
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2331'>BEAM-2331</a>] -         SQRT Function
</li>
</ul>
            
<h2>        Bug
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-145'>BEAM-145</a>] -         OutputTimeFn#assignOutputTime overrides WindowFn#getOutputTime in unfortunate ways
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-260'>BEAM-260</a>] -         WindowMappingFn: Know the getSideInputWindow upper bound to release side input resources
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-437'>BEAM-437</a>] -         Data-dependent BigQueryIO in batch
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-463'>BEAM-463</a>] -         BoundedHeapCoder should be a StandardCoder and not a CustomCoder
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-539'>BEAM-539</a>] -         Error when writing to the root of a GCS location
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-632'>BEAM-632</a>] -         Dataflow runner does not correctly flatten duplicate inputs
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-655'>BEAM-655</a>] -         Rename @RunnableOnService to something more descriptive
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-662'>BEAM-662</a>] -         SlidingWindows should support sub-second periods
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-828'>BEAM-828</a>] -         Remove PipelineOptions from construction time in BigQueryIO
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1013'>BEAM-1013</a>] -         Recheck all existing programming guide code snippets for correctness
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1022'>BEAM-1022</a>] -         WindowNamespace and WindowAndTriggerNamespace should not use Java object equality when comparing windows
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1040'>BEAM-1040</a>] -         Hadoop InputFormat - IO Transform for reads
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1048'>BEAM-1048</a>] -         Spark Runner streaming batch duration does not include duration of reading from source 
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1053'>BEAM-1053</a>] -         ApexGroupByKeyOperator serialization issues
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1068'>BEAM-1068</a>] -         Service Account Credentials File Specified via Pipeline Option Ignored
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1101'>BEAM-1101</a>] -         Remove inconsistencies in Python PipelineOptions
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1213'>BEAM-1213</a>] -         WordCount example failure on Apex Runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1247'>BEAM-1247</a>] -         Session state should not be lost when discardingFiredPanes
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1264'>BEAM-1264</a>] -         Python ChannelFactory Raise Inconsistent Error for Local FS and GCS
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1283'>BEAM-1283</a>] -         DoFn finishBundle should be required to specify the window for output
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1316'>BEAM-1316</a>] -         DoFn#startBundle should not be able to output
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1355'>BEAM-1355</a>] -         HDFS IO should comply with PTransform style guide
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1362'>BEAM-1362</a>] -         Update the beam release process to include python sdk
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1366'>BEAM-1366</a>] -         Add metrics checks to Python SDK once metrics have been implemented
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1381'>BEAM-1381</a>] -         Implement DataflowMetrics.query method
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1383'>BEAM-1383</a>] -         Consistency in the Metrics examples
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1402'>BEAM-1402</a>] -         Make TextIO and AvroIO use best-practice types.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1414'>BEAM-1414</a>] -         CountingInput should comply with PTransform style guide
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1415'>BEAM-1415</a>] -         PubsubIO should comply with PTransform style guide
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1418'>BEAM-1418</a>] -         MapElements and FlatMapElements should comply with PTransform style guide
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1422'>BEAM-1422</a>] -         ParDo should comply with PTransform style guide
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1425'>BEAM-1425</a>] -         Window should comply with PTransform style guide
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1428'>BEAM-1428</a>] -         KinesisIO should comply with PTransform style guide
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1459'>BEAM-1459</a>] -         Dataflow runner has deprecated metricsUpdates in favor of counterUpdates. Add setters.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1508'>BEAM-1508</a>] -         PInput, POutput#expand should not be ordered
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1546'>BEAM-1546</a>] -         Specify exact version for Python in the SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1568'>BEAM-1568</a>] -         Ineffective null check in IsmFormat#structuralValue
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1569'>BEAM-1569</a>] -         HDFSFileSource: Unable to read from filePattern with spaces in path
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1571'>BEAM-1571</a>] -         Flatten on a single input PCollection should have a test associated with it
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1572'>BEAM-1572</a>] -         Add per-stage matching of scope in metrics for the DirectRunner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1575'>BEAM-1575</a>] -         Add ValidatesRunner test to PipelineTest.test_metrics_in_source
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1578'>BEAM-1578</a>] -         Runners should put PT overrides into a list rather than map
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1579'>BEAM-1579</a>] -         Runners should verify that PT overrides converged
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1580'>BEAM-1580</a>] -         Typo in bigquery_tornadoes example
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1594'>BEAM-1594</a>] -         Treat JOB_STATE_DRAINED as terminal in DataflowRunner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1629'>BEAM-1629</a>] -         Metrics/aggregators accumulators should be instantiated before traversing pipeline
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1635'>BEAM-1635</a>] -         TypeError in AfterWatermark class&#39;s __repr__ method
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1642'>BEAM-1642</a>] -         Combine transformation evaluation fails on direct runner with Avro as a fallback coder
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1644'>BEAM-1644</a>] -         IO ITs: shared directory for kubernetes resources and PipelineOptions?
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1645'>BEAM-1645</a>] -         Display data not populated on Window.Assign
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1649'>BEAM-1649</a>] -         Fix unresolved references in Python SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1653'>BEAM-1653</a>] -         Error when using PubsubIO with the DirectRunner 
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1656'>BEAM-1656</a>] -         DirectRunner should not call finalize twice in UnboundedSourceExecutorFactory
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1657'>BEAM-1657</a>] -         DirectRunner should not call close twice in UnboundedSourceExecutorFactory
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1671'>BEAM-1671</a>] -         Support bypassing `validate` flag when using tfrecordio
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1673'>BEAM-1673</a>] -         PubSubIO can&#39;t write attributes
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1676'>BEAM-1676</a>] -         SdkCoreApiSurfaceTest Failed When Directory Contains Space
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1686'>BEAM-1686</a>] -         MQTT IO throws exception when client id is not specified
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1690'>BEAM-1690</a>] -         BigQueryTornadoesIT failing
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1694'>BEAM-1694</a>] -         Fix docstring inaccuracies in Python-SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1695'>BEAM-1695</a>] -         Improve Python-SDK&#39;s programming guide
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1709'>BEAM-1709</a>] -         Implement Single-output ParDo as Multi-output ParDo
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1711'>BEAM-1711</a>] -         Document extra features on quick start guide
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1713'>BEAM-1713</a>] -         SparkRuntimeContext instances are leaking via StateSpecFunctions#mapSourceFunction
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1718'>BEAM-1718</a>] -         Returning Duration.millis(Long.MAX_VALUE) in DoFn.getAllowedTimestampSkew() causes Overflow/Underflow
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1719'>BEAM-1719</a>] -         Test modules are included in generated documentation
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1721'>BEAM-1721</a>] -         Reshuffle can shift elements in time
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1723'>BEAM-1723</a>] -         FlinkRunner should deduplicate when an UnboundedSource requires Deduping
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1732'>BEAM-1732</a>] -         Window.Assign does not properly populate DisplayData of the enclosing Window transform
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1737'>BEAM-1737</a>] -         Implement a Single-output ParDo as a Multi-output ParDo with a single output
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1741'>BEAM-1741</a>] -         Update runner pages for Python
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1742'>BEAM-1742</a>] -         UnboundedSource CheckpointMark should have more precise documentation
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1751'>BEAM-1751</a>] -         Singleton ByteKeyRange with BigtableIO and Dataflow runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1762'>BEAM-1762</a>] -         Python SDK Error Message no python 3 compatible
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1767'>BEAM-1767</a>] -         Remove Aggregators from Dataflow runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1768'>BEAM-1768</a>] -         assert_that always passes for empty inputs
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1769'>BEAM-1769</a>] -         Travis - python only executes py27 tox environment
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1770'>BEAM-1770</a>] -         DoFn javadoc claims no runner supports state or timers
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1772'>BEAM-1772</a>] -         Support merging WindowFn other than IntervalWindow on Flink Runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1776'>BEAM-1776</a>] -         Timers should be delivered in the window they were set in
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1777'>BEAM-1777</a>] -         If PipelineEnforcement throws an exception after Pipeline.run() fails, it overwrites the original failure
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1780'>BEAM-1780</a>] -         BigtableReader.splitIntoFraction should more carefully guard input
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1784'>BEAM-1784</a>] -         DataflowPipelineJob.cancel() should be idempotent
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1792'>BEAM-1792</a>] -         Spark runner uses its own filtering logic to match metrics
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1793'>BEAM-1793</a>] -         Frequent python post commit errors
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1795'>BEAM-1795</a>] -         Upgrade google-cloud-bigquery to 0.23.0
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1801'>BEAM-1801</a>] -         default_job_name can generate names not accepted by DataFlow
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1802'>BEAM-1802</a>] -         Spark Runner does not shutdown correctly when executing multiple pipelines in sequence
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1803'>BEAM-1803</a>] -         Metrics filters have a missmatch in class-based namespace
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1810'>BEAM-1810</a>] -         Spark runner combineGlobally uses Kryo serialization
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1815'>BEAM-1815</a>] -         Avoid shuffling twice in GABW
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1818'>BEAM-1818</a>] -         Expose side-channel inputs in PTransform
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1828'>BEAM-1828</a>] -         GlobalWatermarkHolder uses unpersist instead of destory
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1832'>BEAM-1832</a>] -         Potentially unclosed OutputStream in ApexYarnLauncher
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1835'>BEAM-1835</a>] -         NPE in DirectRunner PubsubReader.ackBatch
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1837'>BEAM-1837</a>] -         NPE in KafkaIO writer
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1838'>BEAM-1838</a>] -         GlobalWindow equals() and hashCode() doesn&#39;t work with other serialization frameworks
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1842'>BEAM-1842</a>] -         Stop matching composite PCollectionView PTransforms
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1844'>BEAM-1844</a>] -         test_memory_usage fails in post commit
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1849'>BEAM-1849</a>] -         Output from OnTimer method has windows re-assigned
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1856'>BEAM-1856</a>] -         HDFSFileSink class do not use the same configuration in master and slave
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1862'>BEAM-1862</a>] -         SplittableDoFnOperator should close the ScheduledExecutorService
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1865'>BEAM-1865</a>] -         Input Coder of GroupByKey should be a KV Coder in the Python SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1867'>BEAM-1867</a>] -         Element counts missing on Cloud Dataflow when PCollection has anything other than hardcoded name pattern
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1869'>BEAM-1869</a>] -         getProducingTransformInternal should not be available on any PValue
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1873'>BEAM-1873</a>] -         Javadoc in BigQueryIO doesn&#39;t reflect recent changes
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1886'>BEAM-1886</a>] -         Remove TextIO override in Flink runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1902'>BEAM-1902</a>] -         Datastore IO never retries on errors
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1903'>BEAM-1903</a>] -         Splittable DoFn should report watermarks via ProcessContext
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1904'>BEAM-1904</a>] -         Remove DoFn.ProcessContinuation
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1913'>BEAM-1913</a>] -         TFRecordIO should comply with PTransform style guide
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1914'>BEAM-1914</a>] -         XML IO should comply with PTransform style guide
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1922'>BEAM-1922</a>] -         DataSource in JdbcIO is not closed
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1926'>BEAM-1926</a>] -         Need 3 Python snippets for composite transforms section in programming guide
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1935'>BEAM-1935</a>] -         DirectRunner Cancel should never throw a RejectedExecutionException
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1937'>BEAM-1937</a>] -         PipelineSurgery renumbers already-unique transforms
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1947'>BEAM-1947</a>] -         DisplayData raises exception when passed unicode string
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1954'>BEAM-1954</a>] -         &quot;test&quot; extra need nose in the requirements list
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1963'>BEAM-1963</a>] -         Quick start on home page redirects to java quickstart
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1964'>BEAM-1964</a>] -         Upgrade pylint to 1.7.0
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1966'>BEAM-1966</a>] -         ApexRunner in cluster mode does not register standard FileSystems/IOChannelFactories
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1969'>BEAM-1969</a>] -         GCP extras should not required fix version of proto-google-cloud-datastore-v1
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1970'>BEAM-1970</a>] -         Cannot run UserScore on Flink runner due to AvroCoder classload issues
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1972'>BEAM-1972</a>] -         HIFIO jdk module fails enforcer when only java 7 is installed on machine
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1977'>BEAM-1977</a>] -         PubsubIO fails with NPE on ACK when running locally
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1981'>BEAM-1981</a>] -         Serialization error with TimerInternals in ApexGroupByKeyOperator
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1988'>BEAM-1988</a>] -         utils.path.join does not correctly handle GCS bucket roots
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1989'>BEAM-1989</a>] -         clean SyntaxWarning
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1992'>BEAM-1992</a>] -         Count.perElement javadoc refers to Count.PerElement, but Count.PerElement is private
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1998'>BEAM-1998</a>] -         Update json_values_test.py for ValueProvider
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2017'>BEAM-2017</a>] -         DataflowRunner: fix NullPointerException that can occur when no metrics are present
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2019'>BEAM-2019</a>] -         Count.globally() requires default values for non-GlobalWindows
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2022'>BEAM-2022</a>] -         ApexTimerInternals seems to treat processing time timers as event time timers
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2023'>BEAM-2023</a>] -         BigQueryIO.Write needs a way of dynamically specifying table schemas
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2029'>BEAM-2029</a>] -         NullPointerException when using multi output ParDo in Spark runner in streaming mode.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2040'>BEAM-2040</a>] -         Occasional build failures caused by AutoValue
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2052'>BEAM-2052</a>] -         Windowed file sinks should support dynamic sharding
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2071'>BEAM-2071</a>] -         AttributeError in dataflow_metrics
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2072'>BEAM-2072</a>] -         MicrobatchSource.reader stops reading after reaching maxNumRecords for the first time
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2073'>BEAM-2073</a>] -         Change SourceDStream.rateControlledMaxRecords() to better reflect its intention
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2074'>BEAM-2074</a>] -         SourceDStream&#39;s rate control mechanism may not work
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2077'>BEAM-2077</a>] -         Remove AvroCoder#createDatum(Reader/Writer)
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2084'>BEAM-2084</a>] -         Distribution metrics should be queriable in the Dataflow Runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2086'>BEAM-2086</a>] -         TestDataflowRunner relies on metrics which are not present in streaming jobs
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2091'>BEAM-2091</a>] -         Typo in build instructions in Apex Runner&#39;s README.md 
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2092'>BEAM-2092</a>] -         MicrobatchSource can be relieved of some of its methods since it&#39;s never used as an actual BoundedSource
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2093'>BEAM-2093</a>] -         Update Jackson version to 2.8.8 in archetype (or align with parent pom)
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2094'>BEAM-2094</a>] -         WordCount examples produce garbage for non-English input text
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2095'>BEAM-2095</a>] -         The hasNext method of the iterator returned by SourceRDD#compute is not idempotent
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2096'>BEAM-2096</a>] -         NullPointerException in DataflowMetrics
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2098'>BEAM-2098</a>] -         Walkthrough URL in example code Javadoc is 404 not found
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2105'>BEAM-2105</a>] -         Audit that user-facing stuff is in main jars, not the test suite jars
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2106'>BEAM-2106</a>] -         NotSerializableException thrown when serializing EvaluationContext
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2113'>BEAM-2113</a>] -         Apex Runner is not able to submit any job to YARN
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2114'>BEAM-2114</a>] -         KafkaIO broken with CoderException
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2116'>BEAM-2116</a>] -         PubsubJsonClient doesn&#39;t write user created attributeMap
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2119'>BEAM-2119</a>] -         FileSystems doesn&#39;t install the local filesystem on intialization by default
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2120'>BEAM-2120</a>] -         DataflowPipelineJob processes all log messages with each waitUntilFinish
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2122'>BEAM-2122</a>] -         Writing to partitioned BigQuery tables from Dataflow is causing errors
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2130'>BEAM-2130</a>] -         Ensure options id is never null
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2136'>BEAM-2136</a>] -         AvroCoderTest.testTwoClassLoaders fails on beam_PostCommit_Java_ValidatesRunner_Dataflow
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2143'>BEAM-2143</a>] -         (Mis)Running Dataflow Wordcount gives non-helpful errors
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2152'>BEAM-2152</a>] -         Authentication fails if there is an unauthenticated gcloud tool even if application default credentials are available
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2154'>BEAM-2154</a>] -         Writing to large numbers of BigQuery tables causes out-of-memory 
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2157'>BEAM-2157</a>] -         HadoopFileSystemModuleTest Failed in Some JDK Versions on Jenkins
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2162'>BEAM-2162</a>] -         Add logging during and after long running BigQuery jobs
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2170'>BEAM-2170</a>] -         PubsubIO.readStrings should handle messages without metadata
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2181'>BEAM-2181</a>] -         Upgrade Bigtable dependency to 0.9.6.2
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2183'>BEAM-2183</a>] -         Maven-archetypes should depend on all Beam modules that their sources compile against
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2184'>BEAM-2184</a>] -         OutputTimeFn is not a Fn: in Python, rename to TimestampCombiner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2187'>BEAM-2187</a>] -         SparkRuntimeContextTest fails to compile
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2190'>BEAM-2190</a>] -         User depending on IO-GCP still gets a dependency on protobuf-lite
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2205'>BEAM-2205</a>] -         AttributeError when running datastore wordcount
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2210'>BEAM-2210</a>] -         PubsubIO.readPubsubMessagesWithoutAttributes is awkward
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2211'>BEAM-2211</a>] -         DataflowRunner (Java) rejects all but GCS paths for FileBasedSource/Sink
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2212'>BEAM-2212</a>] -         ValueProvider-ification of core transforms makes logs and errors worse
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2213'>BEAM-2213</a>] -         Java DirectRunner takes 60s to shut down after wordcount runs
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2222'>BEAM-2222</a>] -         Clean up readme files
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2223'>BEAM-2223</a>] -         java8 examples are not running
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2224'>BEAM-2224</a>] -         maptask_executor_runner_test fails in windows
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2229'>BEAM-2229</a>] -         GcsFileSystem attempts to create invalid Metadata
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2233'>BEAM-2233</a>] -         Java 8 examples should separate runners into distinct profiles, like Java 7 examples
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2236'>BEAM-2236</a>] -         Move test utilities out of python core
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2239'>BEAM-2239</a>] -         Step context not always available when exceptions raised.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2240'>BEAM-2240</a>] -         Step context not always available when exceptions raised.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2242'>BEAM-2242</a>] -         Apache Beam Java modules do not correctly shade test artifacts
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2243'>BEAM-2243</a>] -         org.apache.beam.GcpCoreApiSurfaceTest.testApiSurface fails at release-2.0.0 head
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2244'>BEAM-2244</a>] -         Move runner-facing Metrics classes to runners core
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2249'>BEAM-2249</a>] -         AvroIO does not handle partial reads
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2256'>BEAM-2256</a>] -         mongodb sdk MongoDbIO.BoundedMongoDbSource.splitKeysToFilters incorrect
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2259'>BEAM-2259</a>] -         Reshuffle may set watermark holds past the end of time
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2260'>BEAM-2260</a>] -         When using WindowedWrites and default FilenamePolicy, TextIO should throw at construction time
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2275'>BEAM-2275</a>] -         SerializableCoder fails to serialize when used with a generic type token
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2277'>BEAM-2277</a>] -         IllegalArgumentException when using Hadoop file system for WordCount example.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2279'>BEAM-2279</a>] -         Hadoop file system support should be included in examples/archetype profiles of Spark runner.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2305'>BEAM-2305</a>] -         Dinstinct transform produces unexpected output when triggered
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2326'>BEAM-2326</a>] -         Verbose INFO logging with stateful DoFns and Dataflow 
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2429'>BEAM-2429</a>] -         Conflicting filesystems with used of HadoopFileSystem
</li>
</ul>
            
<h2>        New Feature
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-59'>BEAM-59</a>] -         Switch from IOChannelFactory to FileSystems
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-73'>BEAM-73</a>] -         IO design pattern: Decouple Parsers and Coders
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-135'>BEAM-135</a>] -         Utilities for &quot;batching&quot; elements in a DoFn
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-147'>BEAM-147</a>] -         Introduce an easy API for pipeline metrics
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-404'>BEAM-404</a>] -         PubsubIO should have a mode that supports maintaining message attributes.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-596'>BEAM-596</a>] -         Support cancel() and waitUntilFinish() in DirectRunner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-638'>BEAM-638</a>] -         Add sink transform to write bounded data per window, pane, [and key] even when PCollection is unbounded
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-846'>BEAM-846</a>] -         Decouple side input window mapping from WindowFn
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-885'>BEAM-885</a>] -         Move PipelineOptions from Pipeline.create() to Pipeline.run()
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1047'>BEAM-1047</a>] -         DataflowRunner: support regionalization.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1076'>BEAM-1076</a>] -         DatastoreIO template Options
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1195'>BEAM-1195</a>] -         Give triggers a cross-language serialization schema
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1198'>BEAM-1198</a>] -         ViewFn: explicitly decouple runner materialization of side inputs from SDK-specific mapping
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1327'>BEAM-1327</a>] -         Replace OutputTimeFn with enum
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1328'>BEAM-1328</a>] -         Serialize/deserialize WindowingStrategy in a language-agnostic manner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1397'>BEAM-1397</a>] -         Introduce IO metrics
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1398'>BEAM-1398</a>] -         KafkaIO metrics
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1441'>BEAM-1441</a>] -         Add FileSystem support to Python SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1855'>BEAM-1855</a>] -         Support Splittable DoFn in Flink Streaming runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1960'>BEAM-1960</a>] -         Hadoop InputFormat - Add Kubernetes large and small cluster Scripts for Cassandra and Elasticsearch tests
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2005'>BEAM-2005</a>] -         Add a Hadoop FileSystem implementation of Beam&#39;s FileSystem
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2054'>BEAM-2054</a>] -         Upgrade dataflow.version to v1b3-rev196-1.22.0
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2147'>BEAM-2147</a>] -         Re-enable UsesTimersInParDo tests for DataflowRunner
</li>
</ul>
    
<h2>        Improvement
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-447'>BEAM-447</a>] -         Stop referring to types with Bound/Unbound
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-649'>BEAM-649</a>] -         Smarter caching of RDDs
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-720'>BEAM-720</a>] -         Run WindowedWordCount Integration Test in Flink
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-806'>BEAM-806</a>] -         Maven Release Plugin Does Not Set Archetype Versions
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-818'>BEAM-818</a>] -         ValueProvider for tempLocation, runner, etc, that is unavailable to transforms during construction
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-831'>BEAM-831</a>] -         ParDo Chaining
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-848'>BEAM-848</a>] -         Shuffle input read-values to get maximum parallelism.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-911'>BEAM-911</a>] -         Mark API of multiple IOs as @Experimental
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1071'>BEAM-1071</a>] -         Support pre-existing tables with streaming BigQueryIO
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1074'>BEAM-1074</a>] -         Set default-partitioner in SourceRDD.Unbounded.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1148'>BEAM-1148</a>] -         Port PAssert away from Aggregators
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1179'>BEAM-1179</a>] -         Update assertions of source_test_utils from camelcase to underscore-separated
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1182'>BEAM-1182</a>] -         Direct runner should enforce encodability of unbounded source checkpoints
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1199'>BEAM-1199</a>] -         Condense recordAsOutput, finishSpecifyingOutput from POutput
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1242'>BEAM-1242</a>] -         convert older IO/Sources to use standard ReadTransform style
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1269'>BEAM-1269</a>] -         BigtableIO should make more efficient use of connections
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1272'>BEAM-1272</a>] -         Align the naming of &quot;generateInitialSplits&quot; and &quot;splitIntoBundles&quot; to better reflect their intention
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1294'>BEAM-1294</a>] -         Long running UnboundedSource Readers
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1336'>BEAM-1336</a>] -         A StateSpec that doesn&#39;t care about the key shouldn&#39;t be forced to declare it as type Object
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1337'>BEAM-1337</a>] -         Use our coder infrastructure for coders for state
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1340'>BEAM-1340</a>] -         Remove or make private public bits of the SDK that shouldn&#39;t be public
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1345'>BEAM-1345</a>] -         Mark @Experimental and @Internal where needed in user-facing bits of the codebase
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1401'>BEAM-1401</a>] -         Sinks in Beam should supported windowed unbounded PCollections
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1447'>BEAM-1447</a>] -         Autodetect streaming/not streaming in DataflowRunner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1491'>BEAM-1491</a>] -         HadoopFileSystemOptions should be able to read the HADOOP_CONF_DIR(YARN_CONF_DIR) environment variable
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1514'>BEAM-1514</a>] -         change default timestamp in KafkaIO
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1520'>BEAM-1520</a>] -         Implement TFRecordIO (Reading/writing Tensorflow Standard format)
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1530'>BEAM-1530</a>] -         BigQueryIO should support value-dependent windows
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1539'>BEAM-1539</a>] -         Support unknown length iterables for IterableCoder in Python SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1562'>BEAM-1562</a>] -         Use a &quot;signal&quot; to stop streaming tests as they finish.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1573'>BEAM-1573</a>] -         KafkaIO does not allow using Kafka serializers and deserializers
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1633'>BEAM-1633</a>] -         Move .tox/ directory under target/ in Python SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1660'>BEAM-1660</a>] -         withCoder() error in JdbcIO JavaDoc example
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1661'>BEAM-1661</a>] -         shade guava in beam-sdks-java-io-jdbc
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1672'>BEAM-1672</a>] -         Accumulable MetricsContainers.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1689'>BEAM-1689</a>] -         Apply changes for Flink&#39;s StatefulDoFnRunner to the primary StatefulDoFnRunner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1693'>BEAM-1693</a>] -         Detect supported Python &amp; pip executables in Python-SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1704'>BEAM-1704</a>] -         Create.TimestampedValues should take a TypeDescriptor as an alternative to explicitly specifying the Coder
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1708'>BEAM-1708</a>] -         Better error messages when GCP features are not installed 
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1727'>BEAM-1727</a>] -         Add setForNowAlign(period, offset) to Timer
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1740'>BEAM-1740</a>] -         Update bigtable version to 0.9.5.1
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1743'>BEAM-1743</a>] -         View.AsSingleton should be implemented in terms of a Global Combine, not the reverse
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1749'>BEAM-1749</a>] -         Upgrade pep8 to pycodestyle
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1786'>BEAM-1786</a>] -         AutoService registration of coders, like we do with PipelineRunners
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1794'>BEAM-1794</a>] -         Bigtable: improve user agent
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1799'>BEAM-1799</a>] -         IO ITs: simplify data loading design pattern
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1807'>BEAM-1807</a>] -         IO ITs: shared language neutral directory for kubernetes resources
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1812'>BEAM-1812</a>] -         Allow configuring checkpoints in Flink Runner PipelineOptions
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1827'>BEAM-1827</a>] -         Fix use of deprecated Spark APIs in the runner. 
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1829'>BEAM-1829</a>] -         MQTT message compression not working on Rapsberry Pi
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1830'>BEAM-1830</a>] -         add &#39;withTopic()&#39; api to KafkaIO Reader
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1839'>BEAM-1839</a>] -         Optimize StatelessJavaSerializer
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1851'>BEAM-1851</a>] -         Sample.fixedSizedGlobally documentation should include single worker memory constraint
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1858'>BEAM-1858</a>] -         improve error message when Create.of() is called with an empty iterator
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1863'>BEAM-1863</a>] -         Allow users to override the base container image but still choose image type
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1864'>BEAM-1864</a>] -         Shorten combining state names: &quot;CombiningValue&quot; and &quot;AccumulatorCombiningState&quot; to Combining (as appropriate)
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1870'>BEAM-1870</a>] -         ByteKey / ByteKeyRangeTracker should not use ByteString on public API surface
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1871'>BEAM-1871</a>] -         Thin Java SDK Core
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1875'>BEAM-1875</a>] -         Remove Spark runner custom Hadoop and Avro IOs.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1876'>BEAM-1876</a>] -         GroupIntoBatches may be able to use Combine.BinaryCombineLongFn
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1877'>BEAM-1877</a>] -         Use Iterables.isEmpty in GroupIntoBatches
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1882'>BEAM-1882</a>] -         Jdbc k8s scripts: switch pod -&gt; replicaController
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1895'>BEAM-1895</a>] -         Create tranform in python sdk should be a custom source
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1897'>BEAM-1897</a>] -         Remove Sink
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1907'>BEAM-1907</a>] -         Delete PubsubBoundedReader
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1908'>BEAM-1908</a>] -         Allow setting CREATE_NEVER when using a tablespec in BigQueryIO
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1921'>BEAM-1921</a>] -         expose connectionProperties in JdbcIO
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1923'>BEAM-1923</a>] -         Improve python log messages for temporary BigQuery tables
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1949'>BEAM-1949</a>] -         Rename DoFn.Context#sideOutput to #output
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1990'>BEAM-1990</a>] -         Window.Assign should not be public since it is not meant to be used publicly
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1991'>BEAM-1991</a>] -         Update references to SumDoubleFn =&gt; Sum.ofDoubles
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1993'>BEAM-1993</a>] -         Remove special unbounded Flink source/sink
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1994'>BEAM-1994</a>] -         Remove Flink examples package
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2013'>BEAM-2013</a>] -         Upgrade to Jackson 2.8.8
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2014'>BEAM-2014</a>] -         Upgrade to Google Auth 0.6.1
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2020'>BEAM-2020</a>] -         Move CloudObject to Dataflow runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2021'>BEAM-2021</a>] -         Fix Java&#39;s Coder class hierarchy
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2044'>BEAM-2044</a>] -         Downgrade HBaseIO to use the stable HBase client version (1.2.x)
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2047'>BEAM-2047</a>] -         PubsubStreamingWrite should use the input coder by default
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2049'>BEAM-2049</a>] -         Remove KeyedCombineFn
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2051'>BEAM-2051</a>] -         Reduce scope of the PCollectionView interface
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2060'>BEAM-2060</a>] -         XmlIO use harcoded Charset
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2062'>BEAM-2062</a>] -         EventHandler jaxb unmarshaller should be optional
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2067'>BEAM-2067</a>] -         Add support for generic CoderProvider -&gt; CoderFactory mapping with CoderRegistrar
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2068'>BEAM-2068</a>] -         Upgrade Google-Apitools to latest version
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2075'>BEAM-2075</a>] -         Update flink runner to use flink version 1.2.1
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2076'>BEAM-2076</a>] -         DirectRunner: minimal transitive API surface
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2099'>BEAM-2099</a>] -         Create a WordCount example that works with HDFS
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2135'>BEAM-2135</a>] -         Rename hdfs module to hadoop-file-system, rename gcp-core to google-cloud-platform-core
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2144'>BEAM-2144</a>] -         Do not publish javadoc for Java SDK&#39;s util directory
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2165'>BEAM-2165</a>] -         Support custom user Jackson modules for PipelineOptions
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2166'>BEAM-2166</a>] -         Remove Coder.Context from the public API
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2174'>BEAM-2174</a>] -         Allow coder factories to create Coders for a wider range of types
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2206'>BEAM-2206</a>] -         Move pipeline options into separate package from beam/utils
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2218'>BEAM-2218</a>] -         PubsubIO.readPubsubMessages function names are too long
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2221'>BEAM-2221</a>] -         Make KafkaIO coder specification less awkward
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2241'>BEAM-2241</a>] -         Correctly mark top level classes and functions as private
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2245'>BEAM-2245</a>] -         Remove user-facing Timer.cancel() until further notice
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2250'>BEAM-2250</a>] -         Remove FnHarness code from PyDocs
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-3770'>BEAM-3770</a>] -         The problem of kafkaIO sdk for data latency
</li>
</ul>
    
<h2>        Test
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1184'>BEAM-1184</a>] -         Add integration tests for ElasticsearchIO
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1622'>BEAM-1622</a>] -         Java: Rename RunnableOnService to ValidatesRunner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1752'>BEAM-1752</a>] -         Tag Spark runner tests that recover from checkpoint.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2057'>BEAM-2057</a>] -         Test metrics are reported to Spark Metrics sink.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2368'>BEAM-2368</a>] -         one throw &quot;Unable to find registrar for hdfs&quot; with same code
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-3383'>BEAM-3383</a>] -         Create validates runner metrics tests
</li>
</ul>
    
<h2>        Wish
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-378'>BEAM-378</a>] -         Integrate Python SDK in the Maven build
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-797'>BEAM-797</a>] -         A PipelineVisitor that creates a Spark-native pipeline. 
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1648'>BEAM-1648</a>] -         Replace gsutil calls with Cloud Storage API
</li>
</ul>
    
<h2>        Task
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-825'>BEAM-825</a>] -         Fill in the documentation/runners/apex portion of the website
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1027'>BEAM-1027</a>] -         Hosting data stores to enable IO Transform testing
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1353'>BEAM-1353</a>] -         Beam should comply with PTransform style guide
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1764'>BEAM-1764</a>] -         Remove Aggregators from Flink Runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1765'>BEAM-1765</a>] -         Remove Aggregators from Spark runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1766'>BEAM-1766</a>] -         Remove Aggregators from Apex runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1797'>BEAM-1797</a>] -         add CoGroupByKey to chapter &#39;Using GroupByKey&#39;  
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1887'>BEAM-1887</a>] -         Switch ParDo execution to use new DoFn in Apex runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1915'>BEAM-1915</a>] -         Remove OldDoFn dependency in ApexGroupByKeyOperator
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2016'>BEAM-2016</a>] -         Delete HDFSFileSource/Sink
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2124'>BEAM-2124</a>] -         Deprecate &lt;pipeline&gt;.options usage
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2139'>BEAM-2139</a>] -         Disable SplittableDoFn ValidatesRunner tests for Streaming Flink Runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2180'>BEAM-2180</a>] -         Upgrade Apex dependency to 3.6.0
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2235'>BEAM-2235</a>] -         Restore wordcount example to its previous state
</li>
</ul>