Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BEAM-14035] Convert BigQuery SchemaIO to SchemaTransform #17607

Conversation

damondouglas
Copy link
Contributor

@damondouglas damondouglas commented May 10, 2022

Please check #17181 for more context on the code changes.

This PR, currently work-in-progress, closes BEAM-14035 by creating:

  • BigQuerySchemaTransformReadConfiguration - Configuration for reading from BigQuery.
  • BigQuerySchemaTransformReadProvider - an implementation of TypedSchemaTransformProvider
  • BigQuerySchemaTransformWriteConfiguration - Configuration for writing to BigQuery.
  • BigQuerySchemaTransformWriteProvider - an implementation of TypedSchemaTransformProvider

Note I created https://issues.apache.org/jira/browse/BEAM-14168 to handle additional BigQueryIO read and write parameters for a future PR. Additionally, I created https://issues.apache.org/jira/browse/BEAM-14260 to add additional integration tests in a future PR.

I would like to request to following to review this PR:
@angoenka


Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:

  • Choose reviewer(s) and mention them in a comment (R: @username).
  • Format the pull request title like [BEAM-XXX] Fixes bug in ApproximateQuantiles, where you replace BEAM-XXX with the appropriate JIRA issue, if applicable. This will automatically link the pull request to the issue.
  • Update CHANGES.md with noteworthy changes.
  • If this contribution is large, please file an Apache Individual Contributor License Agreement.

See the Contributor Guide for more tips on how to make review process smoother.

To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md

GitHub Actions Tests Status (on master branch)

Build python source distribution and wheels
Python tests
Java tests

See CI.md for more information about GitHub Actions CI.

@asf-ci
Copy link

asf-ci commented May 10, 2022

Can one of the admins verify this patch?

2 similar comments
@asf-ci
Copy link

asf-ci commented May 10, 2022

Can one of the admins verify this patch?

@asf-ci
Copy link

asf-ci commented May 10, 2022

Can one of the admins verify this patch?

@damondouglas
Copy link
Contributor Author

Hello @MarcoRob (cc: @angoenka ), Would anything in this PR lead to breaking org.apache.beam.sdk.io.pulsar.PulsarIOTest.testReadFromSimpleTopic?

beam_PreCommit_Java_Commit/22553 shows:

Error Message
org.apache.beam.sdk.Pipeline$PipelineExecutionException: java.lang.IllegalArgumentException: Trying to claim offset 1652219535602 before start of the range [1652219535973, 1652219536007)

@MarcoRob
Copy link
Contributor

Hi @damondouglas

No, actually it is a flaky test that is rarely failing and I am monitoring and validating in this PR (#17473).

Please run again the Java PreCommit to get the checks passed.
Thanks!

@damondouglas
Copy link
Contributor Author

Run Java PreCommit

@angoenka angoenka merged commit ee5888d into apache:master May 16, 2022
@damondouglas damondouglas deleted the BEAM-14035-BigQuerySchemaIOProvider-refactor branch May 16, 2022 18:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants