Release Notes - ASF JIRA

Release Notes - Beam - Version 2.29.0 - HTML format

Configure Release Notes

Sub-task

[BEAM-7092] - Add Spark 3 test jobs to the CI (Java 8)
[BEAM-7637] - Migrate S3FileSystem to AWS SDK for Java 2
[BEAM-8696] - Beam Dependency Update Request: com.google.protobuf:protobuf-java
[BEAM-9282] - Create new module for Spark 3 runner
[BEAM-9283] - Add Spark 3 test jobs to the CI (Java 11)
[BEAM-10944] - Support CREATE FUNCTION statement with Java UDF
[BEAM-11126] - Beam Dependency Update Request: org.checkerframework:checker-qual
[BEAM-11654] - Publish Spark 2 and 3 specific Job-Server containers
[BEAM-11747] - Reject the mixed Java UDF and ZetaSQL builtin operators cases
[BEAM-11771] - Beam Dependency Update Request: org.checkerframework:checker
[BEAM-11899] - Beam Dependency Update Request: org.apache.commons:commons-pool2

Bug

[BEAM-8221] - NullPointerException in reading from non-existent Kafka topic
[BEAM-9239] - Dependency conflict with Spark using aws io
[BEAM-10582] - Beam Dependency Update Request: pyarrow
[BEAM-11033] - Update Dataflow metrics processor to handle portable jobs
[BEAM-11125] - Beam Dependency Update Request: org.checkerframework
[BEAM-11326] - Enforce deadlines during splitAtFraction in BigQueryStorageStreamSource
[BEAM-11613] - Update Dataflow multi-language pipelines to use SDK harness images available in GCR
[BEAM-11647] - beam_PreCommit_Go_Cron flaky
[BEAM-11657] - Kafka read performance regression due to added header support
[BEAM-11706] - TriggerProto translation shows up as 1% cpu on some benchmarks
[BEAM-11719] - Enforce deterministic coding for GroupByKey and Stateful DoFns
[BEAM-11720] - Beam hardcodes pip path, which may be inconvenient for some custom container users.
[BEAM-11746] - GroupIntoBatchesTest.testInGlobalWindow flaky
[BEAM-11749] - Portable Flink runner skips timers when dynamic timer tags are used
[BEAM-11784] - Java pipeline proto serialization does not ensure topological ordering of root transforms
[BEAM-11801] - BigtableIO should not set useCachedDataPool when using an emulator
[BEAM-11807] - SDK Worker with multithreading causes boto3 the KeyError(endpoint_resolver)
[BEAM-11815] - Fail to read more than 1M of items with DynamoDBIO
[BEAM-11824] - WindowingStrategyTranslation does not set merge status in proto
[BEAM-11833] - UnboundedSourceAsSDFRestrictionTracker reports incorrect watermark after failed claim
[BEAM-11834] - Array elements are assumed not to be nullable.
[BEAM-11848] - publish_docker_images script fails to deploy images for 2.28.0 RC1
[BEAM-11861] - ParquetIO throws Coder not found when using parseGenericRecord or parseFilesGenericRecord
[BEAM-11862] - Write To Kafka does not work
[BEAM-11863] - Java Quick Start is not working on MAC M1
[BEAM-11864] - NPE when registering fromRowFunction
[BEAM-11881] - DataFrame subpartitioning order is incorrect
[BEAM-11884] - Deterministic coding enforcement causes BigQueryBatchFieldLoads/GroupFilesByTableDestinations to fail
[BEAM-11887] - testMergingCustomWindowsWithoutCustomWindowTypes failing on Flink VR
[BEAM-11910] - Increase subsequent page size for bags after the first
[BEAM-11921] - Github actions Java test permared
[BEAM-11929] - DataframeTransfom, BatchRowsAsDataFrame do not preserve field order when schema created with beam.Row
[BEAM-11967] - Dataflow metrics failing in runner v2
[BEAM-11972] - ParquetIO should close all opened channels/readers
[BEAM-11979] - Can't use ReadFromMongoDB with a datetime in filter
[BEAM-12030] - DataFrame read_* functions raise IndexError when no files exist
[BEAM-12042] - TVF with no arguments causes ArrayIndexOutOfBoundsException.
[BEAM-12043] - Terminal external transforms broken on Dataflow
[BEAM-12044] - JdbcIO should explicitly setAutoCommit to false
[BEAM-12054] - Mutator.close() has to be moved to @FinishBundle in WriteFn and DeleteFn
[BEAM-12071] - DataFrame IO sinks do not correctly partition by window
[BEAM-12095] - spark_runner.py broken by Spark 3 upgrade.
[BEAM-12292] - 2.29.0 cherrypick: WindmillStateCache has a 0% hit rate in 2.29

New Feature

[BEAM-5601] - Dataflow runner should support custom windowfn for portability
[BEAM-10861] - Adds URNs and payloads to PubSub transforms
[BEAM-10994] - Add Hot Key Logging in Dataflow Runner
[BEAM-11325] - KafkaIO should be able to read from new added topic/partition automatically during pipeline execution time
[BEAM-11628] - Implement GroupBy.apply
[BEAM-11658] - Match .snappy files into the given (de)compressor
[BEAM-11694] - Re-enable combiner packing for DataflowRunner, FnApiRunner and PortableRunner
[BEAM-11698] - Implement BIT_XOR as CombineFn for Zetasql
[BEAM-11772] - GCP BigQuery sink (file loads) uses runner determined sharding for unbounded data
[BEAM-11850] - Support DDL in SQL Transform
[BEAM-11932] - ServiceOptions for configuring Dataflow

Improvement

[BEAM-2530] - Make Beam compatible with next Java LTS version (Java 11)
[BEAM-10120] - Support Dynamic Timers in the Flink Portable Runner
[BEAM-10671] - Add environment configuration fields as first-class pipeline options
[BEAM-11634] - Give JobInvoker threads unique names.
[BEAM-11705] - Write to bigquery always assigns unique insert id per row causing performance issue
[BEAM-11736] - FnApiRunner should pass PipelineOptions to sdk_worker instances
[BEAM-11752] - Using LoadingCache instead of Map to cache BundleProcessor
[BEAM-11778] - Create an extension of SimpleCatalog.
[BEAM-11789] - Upgrade gradle-dependency-analyze plugin to 1.4.3.
[BEAM-11806] - KafkaIO - Partition Recognition in WriteRecords
[BEAM-11866] - Remove InvalidWindows from Java SDK and use already merged bit
[BEAM-11867] - Remove SYNCHRONIZED_PROCESSING_TIME time domain from model protos
[BEAM-11870] - IllegalArgumentExceptions from Runner.fromOptions in Pipeline.create should be raised as-is
[BEAM-11913] - Add support for Hadoop configuration on ParquetIO
[BEAM-11941] - Upgrade Flink runner to Flink version 1.12.2
[BEAM-11946] - Use ReadFromKafkaDoFn for KafkaIO.Read by default when beam_fn_api is enabled
[BEAM-11958] - Don't use new Jackson APIs to avoid classpath issues when parsing AWS configuration
[BEAM-11969] - Make row-group size configurable in ParquetIO.Sink
[BEAM-12010] - CalcMergeRule should not merge BeamCalcRel and BeamZetaSqlCalcRel.
[BEAM-12033] - Validate casts from double literals to numeric during expression conversion.
[BEAM-12057] - Add missing populateDisplayData methods to ParquetIO

Test

[BEAM-11023] - GroupByKeyTest testLargeKeys100MB and testGroupByKeyWithBadEqualsHashCode are failing on Spark Structured Streaming runner

Wish

[BEAM-11213] - Beam metrics should be displayed in Spark UI

Task

[BEAM-11265] - Java quickstart shouldn't use pom.xml as input
[BEAM-11324] - Additional verification in PartitioningSession

Edit/Copy Release Notes

The text area below allows the project release notes to be edited and copied to another document.

Release Notes - Beam - Version 2.29.0
    
<h2>        Sub-task
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-7092'>BEAM-7092</a>] -         Add Spark 3 test jobs to the CI (Java 8)
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-7637'>BEAM-7637</a>] -         Migrate S3FileSystem to AWS SDK for Java 2
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8696'>BEAM-8696</a>] -         Beam Dependency Update Request: com.google.protobuf:protobuf-java
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9282'>BEAM-9282</a>] -         Create new module for Spark 3 runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9283'>BEAM-9283</a>] -         Add Spark 3 test jobs to the CI (Java 11)
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-10944'>BEAM-10944</a>] -         Support CREATE FUNCTION statement with Java UDF
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11126'>BEAM-11126</a>] -         Beam Dependency Update Request: org.checkerframework:checker-qual
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11654'>BEAM-11654</a>] -         Publish Spark 2 and 3 specific Job-Server containers
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11747'>BEAM-11747</a>] -         Reject the mixed Java UDF and ZetaSQL builtin operators cases 
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11771'>BEAM-11771</a>] -         Beam Dependency Update Request: org.checkerframework:checker
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11899'>BEAM-11899</a>] -         Beam Dependency Update Request: org.apache.commons:commons-pool2
</li>
</ul>
            
<h2>        Bug
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-8221'>BEAM-8221</a>] -         NullPointerException in reading from non-existent Kafka topic
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-9239'>BEAM-9239</a>] -         Dependency conflict with Spark using aws io
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-10582'>BEAM-10582</a>] -         Beam Dependency Update Request: pyarrow
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11033'>BEAM-11033</a>] -         Update Dataflow metrics processor to handle portable jobs
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11125'>BEAM-11125</a>] -         Beam Dependency Update Request: org.checkerframework
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11326'>BEAM-11326</a>] -         Enforce deadlines during splitAtFraction in BigQueryStorageStreamSource
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11613'>BEAM-11613</a>] -         Update Dataflow multi-language pipelines to use SDK harness images available in GCR
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11647'>BEAM-11647</a>] -         beam_PreCommit_Go_Cron flaky
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11657'>BEAM-11657</a>] -         Kafka read performance regression due to added header support
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11706'>BEAM-11706</a>] -         TriggerProto translation shows up as 1% cpu on some benchmarks
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11719'>BEAM-11719</a>] -         Enforce deterministic coding for GroupByKey and Stateful DoFns
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11720'>BEAM-11720</a>] -         Beam hardcodes pip path, which may be inconvenient for some custom container users.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11746'>BEAM-11746</a>] -         GroupIntoBatchesTest.testInGlobalWindow flaky
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11749'>BEAM-11749</a>] -         Portable Flink runner skips timers when dynamic timer tags are used
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11784'>BEAM-11784</a>] -         Java pipeline proto serialization does not ensure topological ordering of root transforms
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11801'>BEAM-11801</a>] -         BigtableIO should not set useCachedDataPool when using an emulator
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11807'>BEAM-11807</a>] -         SDK Worker with multithreading causes boto3 the KeyError(endpoint_resolver)
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11815'>BEAM-11815</a>] -         Fail to read more than 1M of items with DynamoDBIO
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11824'>BEAM-11824</a>] -         WindowingStrategyTranslation does not set merge status in proto
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11833'>BEAM-11833</a>] -         UnboundedSourceAsSDFRestrictionTracker reports incorrect watermark after failed claim
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11834'>BEAM-11834</a>] -         Array elements are assumed not to be nullable.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11848'>BEAM-11848</a>] -         publish_docker_images script fails to deploy images for 2.28.0 RC1
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11861'>BEAM-11861</a>] -         ParquetIO throws Coder not found when using parseGenericRecord or parseFilesGenericRecord
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11862'>BEAM-11862</a>] -         Write To Kafka does not work
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11863'>BEAM-11863</a>] -         Java Quick Start is not working on MAC M1
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11864'>BEAM-11864</a>] -         NPE when registering fromRowFunction
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11881'>BEAM-11881</a>] -         DataFrame subpartitioning order is incorrect
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11884'>BEAM-11884</a>] -         Deterministic coding enforcement causes BigQueryBatchFieldLoads/GroupFilesByTableDestinations to fail
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11887'>BEAM-11887</a>] -         testMergingCustomWindowsWithoutCustomWindowTypes failing on Flink VR
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11910'>BEAM-11910</a>] -         Increase subsequent page size for bags after the first
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11921'>BEAM-11921</a>] -         Github actions Java test permared
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11929'>BEAM-11929</a>] -         DataframeTransfom, BatchRowsAsDataFrame do not preserve field order when schema created with beam.Row
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11967'>BEAM-11967</a>] -         Dataflow metrics failing in runner v2
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11972'>BEAM-11972</a>] -         ParquetIO should close all opened channels/readers
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11979'>BEAM-11979</a>] -         Can&#39;t use ReadFromMongoDB with a datetime in filter
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-12030'>BEAM-12030</a>] -         DataFrame read_* functions raise IndexError when no files exist
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-12042'>BEAM-12042</a>] -         TVF with no arguments causes ArrayIndexOutOfBoundsException.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-12043'>BEAM-12043</a>] -         Terminal external transforms broken on Dataflow
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-12044'>BEAM-12044</a>] -         JdbcIO should explicitly setAutoCommit to false
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-12054'>BEAM-12054</a>] -         Mutator.close() has to be moved to @FinishBundle in WriteFn and DeleteFn 
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-12071'>BEAM-12071</a>] -         DataFrame IO sinks do not correctly partition by window
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-12095'>BEAM-12095</a>] -         spark_runner.py broken by Spark 3 upgrade.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-12292'>BEAM-12292</a>] -         2.29.0 cherrypick: WindmillStateCache has a 0% hit rate in 2.29
</li>
</ul>
            
<h2>        New Feature
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-5601'>BEAM-5601</a>] -         Dataflow runner should support custom windowfn for portability
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-10861'>BEAM-10861</a>] -         Adds URNs and payloads to PubSub transforms
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-10994'>BEAM-10994</a>] -         Add Hot Key Logging in Dataflow Runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11325'>BEAM-11325</a>] -         KafkaIO should be able to read from new added topic/partition automatically during pipeline execution time
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11628'>BEAM-11628</a>] -         Implement GroupBy.apply
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11658'>BEAM-11658</a>] -         Match .snappy files into the given (de)compressor
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11694'>BEAM-11694</a>] -         Re-enable combiner packing for DataflowRunner, FnApiRunner and PortableRunner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11698'>BEAM-11698</a>] -         Implement BIT_XOR as CombineFn for Zetasql
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11772'>BEAM-11772</a>] -         GCP BigQuery sink (file loads) uses runner determined sharding for unbounded data
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11850'>BEAM-11850</a>] -         Support DDL in SQL Transform
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11932'>BEAM-11932</a>] -         ServiceOptions for configuring Dataflow
</li>
</ul>
    
<h2>        Improvement
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-2530'>BEAM-2530</a>] -         Make Beam compatible with next Java LTS version (Java 11)
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-10120'>BEAM-10120</a>] -         Support Dynamic Timers in the Flink Portable Runner
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-10671'>BEAM-10671</a>] -         Add environment configuration fields as first-class pipeline options
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11634'>BEAM-11634</a>] -         Give JobInvoker threads unique names.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11705'>BEAM-11705</a>] -         Write to bigquery always assigns unique insert id per row causing performance issue
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11736'>BEAM-11736</a>] -         FnApiRunner should pass PipelineOptions to sdk_worker instances
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11752'>BEAM-11752</a>] -         Using LoadingCache instead of Map to cache BundleProcessor
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11778'>BEAM-11778</a>] -         Create an extension of SimpleCatalog.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11789'>BEAM-11789</a>] -         Upgrade gradle-dependency-analyze plugin to 1.4.3.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11806'>BEAM-11806</a>] -         KafkaIO - Partition Recognition in WriteRecords
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11866'>BEAM-11866</a>] -         Remove InvalidWindows from Java SDK and use already merged bit
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11867'>BEAM-11867</a>] -         Remove SYNCHRONIZED_PROCESSING_TIME time domain from model protos
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11870'>BEAM-11870</a>] -         IllegalArgumentExceptions from Runner.fromOptions in Pipeline.create should be raised as-is
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11913'>BEAM-11913</a>] -         Add support for Hadoop configuration on ParquetIO
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11941'>BEAM-11941</a>] -         Upgrade Flink runner to Flink version 1.12.2
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11946'>BEAM-11946</a>] -         Use ReadFromKafkaDoFn for KafkaIO.Read by default when beam_fn_api is enabled
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11958'>BEAM-11958</a>] -         Don&#39;t use new Jackson APIs to avoid classpath issues when parsing AWS configuration
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11969'>BEAM-11969</a>] -         Make row-group size configurable in ParquetIO.Sink
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-12010'>BEAM-12010</a>] -         CalcMergeRule should not merge BeamCalcRel and BeamZetaSqlCalcRel.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-12033'>BEAM-12033</a>] -         Validate casts from double literals to numeric during expression conversion.
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-12057'>BEAM-12057</a>] -         Add missing populateDisplayData methods to ParquetIO
</li>
</ul>
    
<h2>        Test
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11023'>BEAM-11023</a>] -         GroupByKeyTest testLargeKeys100MB and testGroupByKeyWithBadEqualsHashCode are failing on Spark Structured Streaming runner
</li>
</ul>
    
<h2>        Wish
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11213'>BEAM-11213</a>] -         Beam metrics should be displayed in Spark UI
</li>
</ul>
    
<h2>        Task
</h2>
<ul>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11265'>BEAM-11265</a>] -         Java quickstart shouldn&#39;t use pom.xml as input
</li>
<li>[<a href='https://issues.apache.org/jira/browse/BEAM-11324'>BEAM-11324</a>] -         Additional verification in PartitioningSession
</li>
</ul>