You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
*Enable requesting `column-lineage` endpoint by a dataset version, job version or dataset field of a specific dataset version.*
9
+
10
+
### Fixed
11
+
12
+
* Allow null column type in column-lineage [`#2272`](https://github.com/MarquezProject/marquez/pull/2272)[@pawel-big-lebowski](https://github.com/pawel-big-lebowski)
13
+
* Include error message for JSON processing exception [`#2271`](https://github.com/MarquezProject/marquez/pull/2271)[@pawel-big-lebowski](https://github.com/pawel-big-lebowski)
14
+
*In case of JSON processing exceptions Marquez API should return exception message to a client.*
15
+
* Fix column lineage when multiple jobs write to same dataset [`#2289`](https://github.com/MarquezProject/marquez/pull/2289)[@pawel-big-lebowski](https://github.com/pawel-big-lebowski)
16
+
*The fix deprecates the way fields `transformationDescription` and `transformationType` are returned. The depracated way of returning those fields will be removed in 0.30.0.*
* Optimize current runs query for lineage API [`#2211`](https://github.com/MarquezProject/marquez/pull/2211)[@prachim-collab](https://github.com/prachim-collab)
23
+
*Add a simpler, alternate `getCurrentRuns` query that gets only simple runs from the database without the additional data from tables such as `run_args`, `job_context`, `facets`, etc., which required extra table joins.*
24
+
* Add Code Quality, DCO and Governance docs to project [`#2237`](https://github.com/MarquezProject/marquez/pull/2237)[`#2241`](https://github.com/MarquezProject/marquez/pull/2241)[@merobi-hub](https://github.com/MarquezProject/marquez/commits?author=merobi-hub)
25
+
*Adds a number of standard governance and procedure docs to the project.*
26
+
* Add possibility to soft-delete namespaces [`#2244`](https://github.com/MarquezProject/marquez/pull/2244)[@mobuchowski](https://github.com/mobuchowski)
27
+
*Adds the ability to "hide" inactive namespaces. The namespaces are undeleted when a relevant OL event is received.*
28
+
* Add search service proposal [`#2203`](https://github.com/MarquezProject/marquez/pull/2203)[@pawel-big-lebowski](https://github.com/pawel-big-lebowski)
29
+
*Proposes using ElasticSearch as a pluggable search service to enhance the search feature in Marquez and adding the ability to turn it off, as well. Includes ideas about what should be indexed and the requirements for the interface.*
30
+
31
+
### Fixed
32
+
33
+
* Show facets even when dataset has no fields [`#2214`](https://github.com/MarquezProject/marquez/pull/2214)[@JDarDagran](https://github.com/JDarDagran)
34
+
*Changes the logic in the `DatasetInfo` component to always show facets so that dataset facets are visible in the UI even if no dataset fields have been set.*
35
+
* Appreciate column prefix when given for `ended_at`[`#2231`](https://github.com/MarquezProject/marquez/pull/2231)[@fm100](https://github.com/fm100)
36
+
*The `ended_at` column was always null when querying if `columnPrefix` was given for the mapper. Now, `columnPrefix` is included when checking for column existence.*
37
+
* Fix bug keeping jobs from being properly deleted [`#2244`](https://github.com/MarquezProject/marquez/pull/2244)[@mobuchowski](https://github.com/mobuchowski)
38
+
*It wasn't possible to delete jobs created from events that had a `ParentRunFacet`. Now it's possible.*
*The dataset's name column in the `dataset_symlinks` table was shorter than the column in the datasets table. Changes the existing V48 migration script to allow proper migration for users who did not upgrade yet, and adds an extra migration script to extend the column length for users who did upgrade but did not experience the issues.*
* Add support for `parentRun` facet as reported by older Airflow OpenLineage versions [`#2130`](https://github.com/MarquezProject/marquez/pull/2130)[@collado-mike](https://github.com/collado-mike)
44
82
*Adds a `parentRun` alias to the `LineageEvent``RunFacet`.*
45
-
* Add fix and tests for handling Airflow DAGs with dots and task groups [`2126`](https://github.com/MarquezProject/marquez/pull/2126)[@collado-mike](https://github.com/collado-mike)[@wslulciuc](https://github.com/wslulciuc)
83
+
* Add fix and tests for handling Airflow DAGs with dots and task groups [`#2126`](https://github.com/MarquezProject/marquez/pull/2126)[@collado-mike](https://github.com/collado-mike)[@wslulciuc](https://github.com/wslulciuc)
46
84
*Fixes a recent change that broke how Marquez handles DAGs with dots and tasks within task groups and adds test cases to validate.*
47
-
* Fix version bump in `docker/up.sh`[`2129`](https://github.com/MarquezProject/marquez/pull/2129)[@wslulciuc](https://github.com/wslulciuc)
85
+
* Fix version bump in `docker/up.sh`[`#2129`](https://github.com/MarquezProject/marquez/pull/2129)[@wslulciuc](https://github.com/wslulciuc)
48
86
*Defines a `VERSION` variable to bump on a release.*
49
-
* Use `clean` when running `shadowJar` in Dockerfile [`2145`](https://github.com/MarquezProject/marquez/pull/2145)[@wslulciuc](https://github.com/wslulciuc)
87
+
* Use `clean` when running `shadowJar` in Dockerfile [`#2145`](https://github.com/MarquezProject/marquez/pull/2145)[@wslulciuc](https://github.com/wslulciuc)
50
88
*Ensures the directory `api/build/libs/` is cleaned before building the JAR again and updates `.dockerignore` to ignore `api/build/*`.*
51
89
* Fix bug that caused a single run event to create multiple jobs [`#2162`](https://github.com/MarquezProject/marquez/pull/2162)[@collado-mike](https://github.com/collado-mike)
52
90
*Checks to see if a run with the given ID already exists and uses the pre-associated job if so.*
The authors of Marquez are committed to providing secure software of the highest quality possible. To this end, we employ a number of tools and methodologies to ensure that our design, build, maintenance and testing practices maximize efficiency and minimize risk.
4
+
5
+
The specific security and analysis methodologies that we employ include but are not limited to:
6
+
7
+
## Security
8
+
9
+
- Participation in the [OpenSSF Best Practices Badge Program](https://bestpractices.coreinfrastructure.org/en/projects/5106) for Free/Libre and FLOSS projects to ensure that we follow current best practices for quality and security
10
+
- Use of [HTTPS](https://en.wikipedia.org/wiki/HTTPS) for network communication
11
+
- Support for multiple cryptographic algorithms (through the use of HTTPS)
12
+
- Separate storage of authentication credentials according to best practices
13
+
- Use of secure protocols for network communication (through the use of HTTPS)
14
+
- Up-to-date support for TLS/SSL (through the use of [OpenSSL](https://www.openssl.org/))
15
+
- Performance of TLS certificate verification by default before sending HTTP headers with private information (through the use of OpenSSL and HTTPS)
16
+
- Distribution of the software via cryptographically signed releases (on the [PyPI](https://pypi.org/) and [Maven](https://mvnrepository.com/) package repositories)
17
+
- Use of [GitHub](https://github.com/) Issues for vulnerability reporting and tracking
18
+
19
+
## Analysis
20
+
21
+
- Use of [PMD](https://pmd.github.io/) and [Spotless](https://github.com/diffplug/spotless) for Java code linting on pull requests and builds
22
+
- Use of [Flake8](https://flake8.pycqa.org/en/latest/) and [Pytest](https://docs.pytest.org/en/7.2.x/) for Python code linting on pull requests and builds
23
+
- Use of GitHub Issues for bug reporting and tracking
24
+
25
+
## Contact
26
+
27
+
For more information about our approach to quality and security, feel free to reach out to the Marquez development team:
A Contributor may become a Committer by a majority approval of the
38
-
existing Committers. (per the project [charter](https://wiki.lfaidata.foundation/download/attachments/18481434/Marquez%20Project%20Technical%20Charter%20Final_Adopted%2005.21.20.pdf?version=1&modificationDate=1591718661000&api=v2))
37
+
A Contributor may become a Committer by the approval of a majority of the
38
+
existing Committers (as per the project [charter](https://wiki.lfaidata.foundation/download/attachments/18481434/Marquez%20Project%20Technical%20Charter%20Final_Adopted%2005.21.20.pdf?version=1&modificationDate=1591718661000&api=v2)).
Copy file name to clipboardExpand all lines: CONTRIBUTING.md
+43-43Lines changed: 43 additions & 43 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -46,7 +46,7 @@ We use [spotless](https://github.com/diffplug/spotless) to format our code. This
46
46
$ ./gradlew spotlessApply
47
47
```
48
48
49
-
> **Note:** To make formatting code simple, we recommend installing a [plugin](https://github.com/google/google-java-format#intellij-android-studio-and-other-jetbrains-ides) for your favorite IDE. We also us[Lombok](https://projectlombok.org). Though not required, you might want to install the [plugin](https://projectlombok.org/setup/overview) as well.
49
+
> **Note:** To make formatting code simple, we recommend installing a [plugin](https://github.com/google/google-java-format#intellij-android-studio-and-other-jetbrains-ides) for your favorite IDE. We also use[Lombok](https://projectlombok.org). Though not required, you might want to install the [plugin](https://projectlombok.org/setup/overview), as well.
1.[Fork](https://github.com/MarquezProject/marquez/fork) and clone the repository
123
123
2. Make sure all tests pass locally: `./gradlew :api:test`
124
124
3. Create a new [branch](#branching): `git checkout -b feature/my-cool-new-feature`
125
-
4. Make change on your cool new branch
125
+
4. Make a change on your cool new branch
126
126
5. Write a test for your change
127
-
6. Make sure `.java` files are formatted: `./gradlew spotlessJavaCheck`
127
+
6. Make sure `.java` files are formatted: `./gradlew spotlessJavaCheck`
128
128
7. Make sure `.java` files contain a [copyright and license header](#copyright--license)
129
129
8. Make sure to [sign you work](#sign-your-work)
130
-
9. Push change to your fork and [submit a pull request](https://github.com/MarquezProject/marquez/compare)
130
+
9. Push the change to your fork and [submit a pull request](https://github.com/MarquezProject/marquez/compare)
131
131
10. Work with project maintainers to get your change reviewed and merged into the `main` branch
132
132
11. Delete your branch
133
133
@@ -137,17 +137,17 @@ To ensure your pull request is accepted, follow these guidelines:
137
137
* Do your best to have a [well-formed commit message](https://tbaggery.com/2008/04/19/a-note-about-git-commit-messages.html) for your change
138
138
*[Keep diffs small](https://kurtisnusbaum.medium.com/stacked-diffs-keeping-phabricator-diffs-small-d9964f4dcfa6) and self-contained
139
139
* If your change fixes a bug, please [link the issue](https://help.github.com/articles/closing-issues-using-keywords) in your pull request description
140
-
* Any changes to the API reference requires[regenerating](#api-docs) the static `openapi.html` file.
140
+
* Any changes to the API reference require[regenerating](#api-docs) the static `openapi.html` file.
141
141
142
142
> **Note:** A pull request should generally contain only one commit (use `git commit --amend` and `git push --force` or [squash](http://gitready.com/advanced/2009/02/10/squashing-commits-with-rebase.html) existing commits into one).
143
143
144
144
# Branching
145
145
146
-
* Use a _group_ at the beginning of your branch names
146
+
* Use a _group_ at the beginning of your branch names:
147
147
148
148
```
149
149
feature Add or expand a feature
150
-
bug Fix a bug
150
+
bug Fix a bug
151
151
proposal Propose a change
152
152
```
153
153
@@ -156,7 +156,7 @@ To ensure your pull request is accepted, follow these guidelines:
156
156
```
157
157
feature/my-cool-new-feature
158
158
bug/my-bug-fix
159
-
bug/my-other-bug-fix
159
+
bug/my-other-bug-fix
160
160
proposal/my-proposal
161
161
```
162
162
@@ -167,18 +167,18 @@ To ensure your pull request is accepted, follow these guidelines:
167
167
# Dependencies
168
168
169
169
We use [renovate](https://github.com/renovatebot/renovate) to manage dependencies for most of our project modules,
170
-
with a couple of exceptions. Renovate automatically detects new dependency versions, and opens pull
171
-
requests to upgrade dependencies in accordance to the [configured rules](https://github.com/MarquezProject/marquez/blob/main/renovate.json).
170
+
with a couple of exceptions. Renovate automatically detects new dependency versions and opens pull
171
+
requests to upgrade dependencies in accordance with the [configured rules](https://github.com/MarquezProject/marquez/blob/main/renovate.json).
172
172
173
-
The following dependencies are managed manually
173
+
The following dependencies are managed manually:
174
174
175
175
*_Web code_ - it is challenging to programmatically validate web content
176
176
*_Spark versions_ - the internal query plans parsed by the Spark OpenLineage integration are not stable across Spark versions
177
177
*_Gradle_ - this tool orchestrates the entire build pipeline and was excluded to ensure stability
178
178
179
179
# Sign Your Work
180
180
181
-
The _sign-off_ is a simple line at the end of the message for a commit. All commits needs to be signed. Your signature certifies that you wrote the patch or otherwise have the right to contribute the material (see [Developer Certificate of Origin](https://developercertificate.org)):
181
+
The _sign-off_ is a simple line at the end of the message for a commit. All commits need to be signed. Your signature certifies that you wrote the patch or otherwise have the right to contribute the material (see [Developer Certificate of Origin](https://developercertificate.org)):
> **Note:** To bundle or serve the API docs, please install [redoc-cli](https://www.npmjs.com/package/redoc-cli).
212
-
213
-
# `COPYRIGHT` / `LICENSE`
214
-
215
-
We use [SPDX](https://spdx.dev) for copyright and license information. The following license header **must** be included in all `java,``bash`, and `py` source files:
216
-
217
-
`java`
218
-
219
-
```
220
-
/*
221
-
* Copyright 2018-2022 contributors to the Marquez project
222
-
* SPDX-License-Identifier: Apache-2.0
223
-
*/
224
-
```
225
-
226
-
`bash`
227
-
228
-
```
229
-
#!/bin/bash
230
-
#
231
-
# Copyright 2018-2022 contributors to the Marquez project
232
-
# SPDX-License-Identifier: Apache-2.0
233
-
```
234
-
235
-
`py`
236
-
237
-
```
238
-
# Copyright 2018-2022 contributors to the Marquez project
239
-
# SPDX-License-Identifier: Apache-2.0
211
+
> **Note:** To bundle or serve the API docs, please install [redoc-cli](https://www.npmjs.com/package/redoc-cli).
212
+
213
+
# `COPYRIGHT` / `LICENSE`
214
+
215
+
We use [SPDX](https://spdx.dev) for copyright and license information. The following license header **must** be included in all `java,``bash`, and `py` source files:
216
+
217
+
`java`
218
+
219
+
```
220
+
/*
221
+
* Copyright 2018-2022 contributors to the Marquez project
222
+
* SPDX-License-Identifier: Apache-2.0
223
+
*/
224
+
```
225
+
226
+
`bash`
227
+
228
+
```
229
+
#!/bin/bash
230
+
#
231
+
# Copyright 2018-2022 contributors to the Marquez project
232
+
# SPDX-License-Identifier: Apache-2.0
233
+
```
234
+
235
+
`py`
236
+
237
+
```
238
+
# Copyright 2018-2022 contributors to the Marquez project
239
+
# SPDX-License-Identifier: Apache-2.0
240
240
```
241
241
242
242
# Resources
243
243
244
244
*[How to Contribute to Open Source](https://opensource.guide/how-to-contribute)
245
245
*[Using the Fork-and-Branch Git Workflow](https://blog.scottlowe.org/2015/01/27/using-fork-branch-git-workflow)
246
246
*[Understanding the GitHub flow](https://guides.github.com/introduction/flow/)
247
-
*[Keep a Changelog](https://keepachangelog.com)
247
+
*[Keeping a Changelog](https://keepachangelog.com)
0 commit comments