Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

All are welcome.

Table of Contents

September 8, 2022 (10am PT)

August 11, 2022 (10am PT)

...

Widget Connector
urlhttp://youtube.com/watch?v=H3Q7Z2eDfy0

Notes:

  • Docs Site Update
    • Lots of activity:
      • 19 closed PRs
    • Infra is becoming robust but not ready to launch
    • URL: openlineage.io/docs
    • Need:
      • additions to About, Getting Started
      • additions to Object Model section
      • Complete the Integration landing page
    • Stretch goal for next month: put it in production
  • Recent releases
  • Extractors: example and tutorial
    • Airflow: defined tasks composed of pieces of code executed by operators (which number in the hundreds)
    • Extraction of data
      • Operator example
        • accesses operator object
        • processes it in customizable way
        • runtime information can also be extracted
          • additional method (`extract_on_complete`) 
      • Metadata matches the structure of the OpenLineage spec
        • supplemented by facets (`job_facets`)
      • How to expose:
        • set up env vars supplying full paths to extractor classes (separated by commas)
      • Help available from OpenLineage side:
        • SQL parser 
        • common library covering a few systems
        • community help on Slack and Github (please contribute your custom extractors!)
    • Typical problems
      • incorrect path provided
        • more debugging info would help in this case – help welcome!
      • Imports from Airflow
        • Python prevents import cycles, leading to extractor failure
        • use local imports instead, with type checking
    • What's the future?
      • debugability
      • additional coverage – PythonOperator, TaskFlow
        • watching AIP-44 in Airflow to make it more data-aware
      • covering hooks
        • e.g., with PythonOperator
    • See also: new doc about this on the forthcoming docs site
    • Q & A
      • Does the documentation link out to the extractors currently in the Airflow library? Helpful for examples
        • we need to add links to the doc
  • Open Discussion
    • Mandy: presenting at Open Source Summit, Dublin, 9/15
    • Ross: talking at ApacheCon in New Orleans
    • Ross: should we create a calendar of events?
    • Maciej: we're looking for feedback on the Flink integration
      • let us know if it solves your problems, etc.
    • Mandy: Egeria running a hackathon as part of the Grace Hopper Open Source Day event on 9/16; theme: sustainability

July 14, 2022 (10am PT)

Attendees:

...