Skip to content

If a Dataset symlink is created afterwards with a DatasetEvent, the link is not created in the lineage #2738

@dkt-sophie-ly

Description

@dkt-sophie-ly

If I create 2 run events that create 2 separate lineage like the following:

ns1:input1 ----- job1 -----> ns2:output1

and

ns1:input2 ------ job2 -----> ns2:output

Then I sent a DatasetEvent to create a symlink and specify that input1 and input2 are in fact the same dataset.

{
  "eventTime": "2023-07-18T17:20:00",
  "dataset": {
    "namespace": "ns1",
    "name": "input1",
    "facets": {
      "symlinks": {
        "identifiers": [
          {
            "namespace": "ns1",
            "name": "input2",
            "type": "DB_TABLE"
          }
        ]
      }
    }
  }
}

So I expected that the 2 lineage merge into one like the following:

ns1:input1 ----- job1 -----> ns2:output1
|
|--------------- job2 ------> ns2:output2

But currently both lineage are not merge and stay separated.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    Status
    No status

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions