Release Notes - ASF JIRA

Release Notes - Beam - Version 2.19.0 - HTML format

Configure Release Notes

Sub-task

[BEAM-4455] - Provide automatic schema registration for Protos
[BEAM-5546] - Beam Dependency Update Request: commons-codec:commons-codec
[BEAM-7116] - Remove KV from Schema transforms
[BEAM-7850] - Make Environment a top level attribute of PTransform
[BEAM-7861] - Make it easy to change between multi-process and multi-thread mode for Python Direct runners
[BEAM-7949] - Add time-based cache threshold support in the data service of the Python SDK harness
[BEAM-7951] - Allow runner to configure customization WindowedValue coder such as ValueOnlyWindowedValueCoder
[BEAM-8623] - Add additional message field to Provision API response for passing status endpoint
[BEAM-8624] - Implement FnService for status api in Dataflow runner
[BEAM-8701] - Beam Dependency Update Request: commons-io:commons-io
[BEAM-8716] - Beam Dependency Update Request: org.apache.commons:commons-csv
[BEAM-8717] - Beam Dependency Update Request: org.apache.commons:commons-lang3
[BEAM-8749] - Beam Dependency Update Request: com.datastax.cassandra:cassandra-driver-mapping
[BEAM-8842] - Consistently timing out: BigQueryStreamingInsertTransformIntegrationTests.test_multiple_destinations_transform
[BEAM-8946] - Report collection size from MongoDBIOIT
[BEAM-8978] - Report saved data size from HadoopFormatIOIT

Bug

[BEAM-2409] - Spark runner produces exactly twice the number of results in streaming mode when use triggers to re-window results on global window.
[BEAM-5495] - PipelineResources algorithm is not working in most environments
[BEAM-7156] - Nexmark Query 14 'SESSION_SIDE_INPUT_JOIN' is producing twice the number of results in Spark Runner
[BEAM-7868] - Hidden Flink Runner parameters are dropped in python pipelines
[BEAM-7991] - gradle cleanPython race
[BEAM-8435] - Allow access to PaneInfo from Python DoFns
[BEAM-8496] - remove SDF translators in flink streaming transform translator
[BEAM-8577] - FileSystems may have not be initialized during ResourceId deserialization
[BEAM-8581] - Python SDK labels ontime empty panes as late
[BEAM-8582] - Python SDK emits duplicate records for Default and AfterWatermark triggers
[BEAM-8810] - Dataflow runner - Work stuck in state COMMITTING with streaming commit rpcs
[BEAM-8830] - fix Flatten tests in Spark Structured Streaming runner
[BEAM-8846] - Force synchronization of the stream observer in BeamFnControlClient
[BEAM-8865] - FileIO's Javadoc is outdated: TypeDescriptors.KVs and unhandled IOException
[BEAM-8885] - PubsubGrpcClient doesn't respect PubsubOptions#getPubsubRootUrl
[BEAM-8943] - SDK harness servers don't shut down properly when SDK harness environment cleanup fails
[BEAM-8955] - AvroSchemaTest.testAvroPipelineGroupBy broken on Spark runner
[BEAM-8959] - Boolean pipeline options which default to true cannot be set to false
[BEAM-8962] - FlinkMetricContainer causes churn in the JobManager and lets the web frontend malfunction
[BEAM-8988] - apache_beam.io.gcp.bigquery_read_it_test failing with: NotImplementedError: BigQuery source must be split before being read
[BEAM-8989] - Backwards incompatible change in ParDo.getSideInputs (caught by failure when running Apache Nemo quickstart)
[BEAM-8995] - apache_beam.io.gcp.bigquery_read_it_test failing on Py3.5 PC with: TypeError: the JSON object must be str, not 'bytes'
[BEAM-8999] - PGBKCVOperation does not respect timestamp combiners
[BEAM-9006] - Meta space memory leak caused by the shutdown hook of ProcessManager
[BEAM-9034] - Update environment_id for ExternalTransform in Python SDK
[BEAM-9050] - Beam pickler doesn't pickle classes that have __module__ set to None.
[BEAM-9060] - Flink suppresses stdout/stderr during JobGraph generation from JAR
[BEAM-9065] - Spark runner accumulates metrics (incorrectly) between runs
[BEAM-9078] - Large Tarball Artifacts Should Use GCS Resumable Upload
[BEAM-9083] - PR9677 breaks ValidatesRunnerTest of open source runners
[BEAM-9123] - HadoopResourceId returns wrong directory name
[BEAM-9127] - postcommit: suites:portable:py2:crossLanguagePortableWordCount failing
[BEAM-9138] - beam_Release_Gradle_Build failure in Go
[BEAM-9144] - Beam's own Avro TimeConversion class in beam-sdk-java-core
[BEAM-9151] - Dataflow legacy worker tests are mis-configured
[BEAM-9423] - Re-Add the stop button to the Flink web interface for pipelines

New Feature

[BEAM-1440] - Create a BigQuery source (that implements iobase.BoundedSource) for Python SDK
[BEAM-6671] - Beam 2.9.0 java.lang.NoSuchFieldError: internal_static_google_rpc_LocalizedMessage_fieldAccessorTable
[BEAM-8139] - Execute portable Spark application jar
[BEAM-8630] - Prototype of BeamSQL Calc using ZetaSQL Expression Evaluator
[BEAM-8844] - [SQL] Create performance tests for BigQueryTable
[BEAM-9023] - Upgrade to ZetaSQL 2019.12.1

Improvement

[BEAM-3419] - Enable iterable side input for beam runners.
[BEAM-3759] - Add support for PaneInfo descriptor in Python SDK
[BEAM-5192] - Support Elasticsearch 7.x
[BEAM-6008] - Improve error reporting in Java/Python PortableRunner
[BEAM-7961] - Add tests for all runner native transforms and some widely used composite transforms to cross-language validates runner test suite
[BEAM-8296] - Containerize the Spark job server
[BEAM-8536] - Migrate usage of DelayedBundleApplication.requested_execution_time to time duration
[BEAM-8745] - More fine-grained controls for the size of a BigQuery Load job
[BEAM-8746] - Allow the local job service to work from inside docker
[BEAM-8801] - PubsubMessageToRow should not check useFlatSchema() in processElement
[BEAM-8816] - Load balance bundle processing w/ multiple SDK workers
[BEAM-8837] - PCollectionVisualizationTest: possible bug
[BEAM-8886] - Add a python mongodbio integration test that triggers load split
[BEAM-8891] - Create and submit Spark portable jar in Python
[BEAM-8901] - add experimental flag for reusing flink local environment
[BEAM-8929] - Remove unnecessary exception handling in FnApiControlClientPoolService
[BEAM-8930] - External workers should receive artifact endpoint when started from python
[BEAM-8935] - Fail fast if sdk harness startup failed
[BEAM-8953] - Extend ParquetIO.Read/ReadFiles.Builder to support Avro GenericData model
[BEAM-8993] - [SQL] MongoDb should use predicate push-down
[BEAM-8996] - Auto-generate pipeline options documentation for FlinkRunner
[BEAM-9000] - Java Test Assertions without toString for GenericJson subclasses
[BEAM-9004] - Update Mockito Matchers usage to ArgumentMatchers since Matchers is deprecated in Mockito 2
[BEAM-9012] - Include `-> None` on Pipeline and PipelineOptions `__init__` methods for pytype compatibility
[BEAM-9019] - Improve Spark Encoders (wrappers of beam coders)
[BEAM-9020] - LengthPrefixUnknownCodersTest to avoid relying on AbstractMap's equality
[BEAM-9053] - Improve error message when unable to get the correct filesystem for specified path in Python SDK
[BEAM-9055] - Unify the config names of Fn Data API across languages
[BEAM-9122] - Add uses_keyed_state step property to python dataflow runner
[BEAM-9163] - pydoc: Update sphinx_rtd_theme

Test

[BEAM-8512] - Add integration tests for Python "flink_runner.py"

Task

[BEAM-2572] - Implement an S3 filesystem for Python SDK
[BEAM-5690] - Issue with GroupByKey in BeamSql using SparkRunner
[BEAM-8342] - upgrade samza runner to use samza 1.3
[BEAM-9358] - BigQueryIO potential write speed regression

Edit/Copy Release Notes

The text area below allows the project release notes to be edited and copied to another document.

Release Notes - Beam - Version 2.19.0
    
<h2>        Sub-task
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-4455'>BEAM-4455</a>] -         Provide automatic schema registration for Protos
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-5546'>BEAM-5546</a>] -         Beam Dependency Update Request: commons-codec:commons-codec
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-7116'>BEAM-7116</a>] -         Remove KV from Schema transforms
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-7850'>BEAM-7850</a>] -         Make Environment a top level attribute of PTransform
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-7861'>BEAM-7861</a>] -         Make it easy to change between multi-process and multi-thread mode for Python Direct runners
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-7949'>BEAM-7949</a>] -         Add time-based cache threshold support in the data service of the Python SDK harness
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-7951'>BEAM-7951</a>] -         Allow runner to configure customization WindowedValue coder such as ValueOnlyWindowedValueCoder
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8623'>BEAM-8623</a>] -         Add additional message field to Provision API response for passing status endpoint
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8624'>BEAM-8624</a>] -         Implement FnService for status api in Dataflow runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8701'>BEAM-8701</a>] -         Beam Dependency Update Request: commons-io:commons-io
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8716'>BEAM-8716</a>] -         Beam Dependency Update Request: org.apache.commons:commons-csv
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8717'>BEAM-8717</a>] -         Beam Dependency Update Request: org.apache.commons:commons-lang3
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8749'>BEAM-8749</a>] -         Beam Dependency Update Request: com.datastax.cassandra:cassandra-driver-mapping
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8842'>BEAM-8842</a>] -         Consistently timing out: BigQueryStreamingInsertTransformIntegrationTests.test_multiple_destinations_transform
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8946'>BEAM-8946</a>] -         Report collection size from MongoDBIOIT
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8978'>BEAM-8978</a>] -         Report saved data size from HadoopFormatIOIT
</li>
</ul>
            
<h2>        Bug
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2409'>BEAM-2409</a>] -         Spark runner produces exactly twice the number of results in streaming mode when use triggers to re-window results on global window.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-5495'>BEAM-5495</a>] -         PipelineResources algorithm is not working in most environments
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-7156'>BEAM-7156</a>] -         Nexmark Query 14 &#39;SESSION_SIDE_INPUT_JOIN&#39; is producing twice the number of results in Spark Runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-7868'>BEAM-7868</a>] -         Hidden Flink Runner parameters are dropped in python pipelines
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-7991'>BEAM-7991</a>] -         gradle cleanPython race
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8435'>BEAM-8435</a>] -         Allow access to PaneInfo from Python DoFns
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8496'>BEAM-8496</a>] -         remove SDF translators in flink streaming transform translator
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8577'>BEAM-8577</a>] -         FileSystems may have not be initialized during ResourceId deserialization
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8581'>BEAM-8581</a>] -         Python SDK labels ontime empty panes as late
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8582'>BEAM-8582</a>] -         Python SDK emits duplicate records for Default and AfterWatermark triggers
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8810'>BEAM-8810</a>] -         Dataflow runner - Work stuck in state COMMITTING with streaming commit rpcs
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8830'>BEAM-8830</a>] -         fix Flatten tests in Spark Structured Streaming runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8846'>BEAM-8846</a>] -         Force synchronization of the stream observer in BeamFnControlClient
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8865'>BEAM-8865</a>] -         FileIO&#39;s Javadoc is outdated: TypeDescriptors.KVs and unhandled IOException
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8885'>BEAM-8885</a>] -         PubsubGrpcClient doesn&#39;t respect PubsubOptions#getPubsubRootUrl
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8943'>BEAM-8943</a>] -         SDK harness servers don&#39;t shut down properly when SDK harness environment cleanup fails
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8955'>BEAM-8955</a>] -         AvroSchemaTest.testAvroPipelineGroupBy broken on Spark runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8959'>BEAM-8959</a>] -         Boolean pipeline options which default to true cannot be set to false
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8962'>BEAM-8962</a>] -         FlinkMetricContainer causes churn in the JobManager and lets the web frontend malfunction
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8988'>BEAM-8988</a>] -         apache_beam.io.gcp.bigquery_read_it_test failing with: NotImplementedError: BigQuery source must be split before being read
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8989'>BEAM-8989</a>] -         Backwards incompatible change in ParDo.getSideInputs (caught by failure when running Apache Nemo quickstart)
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8995'>BEAM-8995</a>] -         apache_beam.io.gcp.bigquery_read_it_test failing on Py3.5 PC with: TypeError: the JSON object must be str, not &#39;bytes&#39;
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8999'>BEAM-8999</a>] -         PGBKCVOperation does not respect timestamp combiners
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9006'>BEAM-9006</a>] -         Meta space memory leak caused by the shutdown hook of ProcessManager 
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9034'>BEAM-9034</a>] -         Update environment_id for ExternalTransform in Python SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9050'>BEAM-9050</a>] -         Beam pickler doesn&#39;t pickle classes that have  __module__ set to None.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9060'>BEAM-9060</a>] -         Flink suppresses stdout/stderr during JobGraph generation from JAR
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9065'>BEAM-9065</a>] -         Spark runner accumulates metrics (incorrectly) between runs
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9078'>BEAM-9078</a>] -         Large Tarball Artifacts Should Use GCS Resumable Upload
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9083'>BEAM-9083</a>] -         PR9677 breaks ValidatesRunnerTest of open source runners 
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9123'>BEAM-9123</a>] -         HadoopResourceId returns wrong directory name
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9127'>BEAM-9127</a>] -         postcommit: suites:portable:py2:crossLanguagePortableWordCount failing
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9138'>BEAM-9138</a>] -         beam_Release_Gradle_Build failure in Go
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9144'>BEAM-9144</a>] -         Beam&#39;s own Avro TimeConversion class in beam-sdk-java-core 
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9151'>BEAM-9151</a>] -         Dataflow legacy worker tests are mis-configured
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9423'>BEAM-9423</a>] -         Re-Add the stop button to the Flink web interface for pipelines
</li>
</ul>
            
<h2>        New Feature
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-1440'>BEAM-1440</a>] -         Create a BigQuery source (that implements iobase.BoundedSource) for Python SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-6671'>BEAM-6671</a>] -         Beam 2.9.0 java.lang.NoSuchFieldError: internal_static_google_rpc_LocalizedMessage_fieldAccessorTable
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8139'>BEAM-8139</a>] -         Execute portable Spark application jar
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8630'>BEAM-8630</a>] -         Prototype of BeamSQL Calc using ZetaSQL Expression Evaluator
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8844'>BEAM-8844</a>] -         [SQL] Create performance tests for BigQueryTable
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9023'>BEAM-9023</a>] -         Upgrade to ZetaSQL 2019.12.1
</li>
</ul>
    
<h2>        Improvement
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-3419'>BEAM-3419</a>] -         Enable iterable side input for beam runners.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-3759'>BEAM-3759</a>] -         Add support for PaneInfo descriptor in Python SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-5192'>BEAM-5192</a>] -         Support Elasticsearch 7.x
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-6008'>BEAM-6008</a>] -         Improve error reporting in Java/Python PortableRunner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-7961'>BEAM-7961</a>] -         Add tests for all runner native transforms and some widely used composite transforms to cross-language validates runner test suite
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8296'>BEAM-8296</a>] -         Containerize the Spark job server
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8536'>BEAM-8536</a>] -         Migrate usage of DelayedBundleApplication.requested_execution_time to time duration 
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8745'>BEAM-8745</a>] -         More fine-grained controls for the size of a BigQuery Load job
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8746'>BEAM-8746</a>] -         Allow the local job service to work from inside docker
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8801'>BEAM-8801</a>] -         PubsubMessageToRow should not check useFlatSchema() in processElement
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8816'>BEAM-8816</a>] -         Load balance bundle processing w/ multiple SDK workers
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8837'>BEAM-8837</a>] -         PCollectionVisualizationTest: possible bug
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8886'>BEAM-8886</a>] -         Add a python mongodbio integration test that triggers load split
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8891'>BEAM-8891</a>] -         Create and submit Spark portable jar in Python
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8901'>BEAM-8901</a>] -         add experimental flag for reusing flink local environment
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8929'>BEAM-8929</a>] -         Remove unnecessary exception handling in FnApiControlClientPoolService
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8930'>BEAM-8930</a>] -         External workers should receive artifact endpoint when started from python
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8935'>BEAM-8935</a>] -         Fail fast if sdk harness startup failed
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8953'>BEAM-8953</a>] -         Extend ParquetIO.Read/ReadFiles.Builder to support Avro GenericData model
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8993'>BEAM-8993</a>] -         [SQL] MongoDb should use predicate push-down
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8996'>BEAM-8996</a>] -         Auto-generate pipeline options documentation for FlinkRunner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9000'>BEAM-9000</a>] -         Java Test Assertions without toString for GenericJson subclasses
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9004'>BEAM-9004</a>] -         Update Mockito Matchers usage to ArgumentMatchers since Matchers is deprecated in Mockito 2
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9012'>BEAM-9012</a>] -         Include `-&gt; None` on Pipeline and PipelineOptions `__init__` methods for pytype compatibility
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9019'>BEAM-9019</a>] -         Improve Spark Encoders (wrappers of beam coders)
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9020'>BEAM-9020</a>] -         LengthPrefixUnknownCodersTest to avoid relying on AbstractMap&#39;s equality
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9053'>BEAM-9053</a>] -         Improve error message when unable to get the correct filesystem for specified path in Python SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9055'>BEAM-9055</a>] -         Unify the config names of Fn Data API across languages
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9122'>BEAM-9122</a>] -         Add uses_keyed_state step property to python dataflow runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9163'>BEAM-9163</a>] -         pydoc: Update sphinx_rtd_theme
</li>
</ul>
    
<h2>        Test
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8512'>BEAM-8512</a>] -         Add integration tests for Python &quot;flink_runner.py&quot;
</li>
</ul>
        
<h2>        Task
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2572'>BEAM-2572</a>] -         Implement an S3 filesystem for Python SDK
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-5690'>BEAM-5690</a>] -         Issue with GroupByKey in BeamSql using SparkRunner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8342'>BEAM-8342</a>] -         upgrade samza runner to use samza 1.3
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9358'>BEAM-9358</a>] -         BigQueryIO potential write speed regression
</li>
</ul>