Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Updated "Fhir Data Pipes" concepts documentation #1242

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

fredhersch
Copy link
Collaborator

@fredhersch fredhersch commented Nov 13, 2024

Description of what I changed

Made changes to the Concepts documentation to bring up to speed with latest features

E2E test

TESTED: on local

Checklist: I completed these to help reviewers :)

  • I have read and will follow the review process.

  • I am familiar with Google Style Guides for the language I have coded in.

    No? Please take some time and review Java and Python style guides.

  • My IDE is configured to follow the Google code styles.

    No? Unsure? -> configure your IDE.

  • I have added tests to cover my changes. (If you refactored existing code that was well tested you do not have to add tests)

  • I ran mvn clean package right before creating this pull request and added all formatting changes to my commit.

  • All new and existing tests passed.

  • My pull request is based on the latest changes of the master branch.

    No? Unsure? -> execute command git pull --rebase upstream master

@codecov-commenter
Copy link

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 51.48%. Comparing base (f0d0a6c) to head (ac086e1).

Additional details and impacted files
@@            Coverage Diff            @@
##             master    #1242   +/-   ##
=========================================
  Coverage     51.48%   51.48%           
  Complexity      716      716           
=========================================
  Files            95       95           
  Lines          5907     5907           
  Branches        787      787           
=========================================
  Hits           3041     3041           
  Misses         2578     2578           
  Partials        288      288           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Copy link
Collaborator

@bashir2 bashir2 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry for the delay @fredhersch.

continuously transform FHIR data into an analytics friendly Parquet on FHIR
format.
continuously transform FHIR data into an analytics friendly "Parquet-on-FHIR"
schema.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: consider adding a link to schema.md.

materialized) which can then be queried using simpler SQL statements.

## Parquet-on-FHIR schema
The output of the _FHIR Data Pipes ETL Pipelines_ is conversion of raw FHIR
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
The output of the _FHIR Data Pipes ETL Pipelines_ is conversion of raw FHIR
The _FHIR Data Pipes ETL Pipelines_ convert raw FHIR

## Parquet-on-FHIR schema
The output of the _FHIR Data Pipes ETL Pipelines_ is conversion of raw FHIR
resources to a _Parquet-on-FHIR_ schema representation. This takes place for
each resource type and follows the [FHIR Data Pipes Schema Mapping Rules](https://github.com/google/fhir-data-pipes/blob/master/doc/schema.md){:.external}.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
each resource type and follows the [FHIR Data Pipes Schema Mapping Rules](https://github.com/google/fhir-data-pipes/blob/master/doc/schema.md){:.external}.
each resource type and follows the [Schema Mapping Rules](https://github.com/google/fhir-data-pipes/blob/master/doc/schema.md).

resources to a _Parquet-on-FHIR_ schema representation. This takes place for
each resource type and follows the [FHIR Data Pipes Schema Mapping Rules](https://github.com/google/fhir-data-pipes/blob/master/doc/schema.md){:.external}.

The generated columnar Parquet files provide the 'base data warehouse' that can
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
The generated columnar Parquet files provide the 'base data warehouse' that can
The generated columnar Parquet files provide the *"base data warehouse"* that can

@@ -179,12 +190,14 @@ Using the ViewDefinition editor you can:

## Output Data Formats

### Conversion to PostgreSQL
Applying the FHIR ViewDefinition resources to the "base dwh" will generate materialized views which represent a "constrained" set of data to be used for downstream analytics applications (such as dashboards or reporting).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit1: please fix the style, e.g., break the lines at 80 chars.

nit2: consider adding a reference to the config param for enabling this, i.e., viewDefinitionsDir


### Conversion to Parquet
The resulting database tables can be loaded into a commonly used relational database management system such as [PostgresSQL](https://www.postgresql.org/) or [MySQL](https://www.mysql.com/).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: Consider adding a reference to the config param that enables this, i.e., sinkDbConfigPath.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants