Skip to content

Commit cbe4f7c

Browse files
zhengruifengrpnkv
authored andcommitted
[SPARK-55392][PYTHON][INFRA] Upgrade Python 3.14 test image to Ubuntu 24.04
### What changes were proposed in this pull request? Upgrade Python 3.14 test image to Ubuntu 24.04 ### Why are the changes needed? to test with a newer OS version ### Does this PR introduce _any_ user-facing change? no, infra-only ### How was this patch tested? PR builder with ``` default: '{"PYSPARK_IMAGE_TO_TEST": "python-314", "PYTHON_TO_TEST": "python3.14"}' ``` https://github.com/zhengruifeng/spark/actions/runs/21749581852/job/62744447326 ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#54175 from zhengruifeng/ubuntu_24_py_14. Authored-by: Ruifeng Zheng <[email protected]> Signed-off-by: Ruifeng Zheng <[email protected]>
1 parent d686581 commit cbe4f7c

File tree

1 file changed

+10
-7
lines changed

1 file changed

+10
-7
lines changed

dev/spark-test-image/python-314/Dockerfile

Lines changed: 10 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -15,16 +15,16 @@
1515
# limitations under the License.
1616
#
1717

18-
# Image for building and testing Spark branches. Based on Ubuntu 22.04.
18+
# Image for building and testing Spark branches. Based on Ubuntu 24.04.
1919
# See also in https://hub.docker.com/_/ubuntu
20-
FROM ubuntu:jammy-20240911.1
20+
FROM ubuntu:noble
2121
LABEL org.opencontainers.image.authors="Apache Spark project <[email protected]>"
2222
LABEL org.opencontainers.image.licenses="Apache-2.0"
2323
LABEL org.opencontainers.image.ref.name="Apache Spark Infra Image For PySpark with Python 3.14"
2424
# Overwrite this label to avoid exposing the underlying Ubuntu OS version label
2525
LABEL org.opencontainers.image.version=""
2626

27-
ENV FULL_REFRESH_DATE=20260203
27+
ENV FULL_REFRESH_DATE=20260206
2828

2929
ENV DEBIAN_FRONTEND=noninteractive
3030
ENV DEBCONF_NONINTERACTIVE_SEEN=true
@@ -54,14 +54,17 @@ RUN apt-get update && apt-get install -y \
5454
&& apt-get clean \
5555
&& rm -rf /var/lib/apt/lists/*
5656

57+
# Setup virtual environment
58+
ENV VIRTUAL_ENV=/opt/spark-venv
59+
RUN python3.14 -m venv --without-pip $VIRTUAL_ENV
60+
ENV PATH="$VIRTUAL_ENV/bin:$PATH"
61+
62+
# Install Python 3.14 packages
63+
RUN curl -sS https://bootstrap.pypa.io/get-pip.py | python3.14
5764

5865
ARG BASIC_PIP_PKGS="numpy pyarrow>=22.0.0 six==1.16.0 pandas==2.3.3 scipy plotly<6.0.0 mlflow>=2.8.1 coverage matplotlib openpyxl memory-profiler>=0.61.0 scikit-learn>=1.3.2 pystack>=1.6.0 psutil"
59-
# Python deps for Spark Connect
6066
ARG CONNECT_PIP_PKGS="grpcio==1.76.0 grpcio-status==1.76.0 protobuf==6.33.5 googleapis-common-protos==1.71.0 zstandard==0.25.0 graphviz==0.20.3"
6167

62-
# Install Python 3.14 packages
63-
RUN curl -sS https://bootstrap.pypa.io/get-pip.py | python3.14
64-
RUN python3.14 -m pip install --ignore-installed 'blinker>=1.6.2' # mlflow needs this
6568
RUN python3.14 -m pip install $BASIC_PIP_PKGS unittest-xml-reporting $CONNECT_PIP_PKGS lxml && \
6669
python3.14 -m pip install torch torchvision --index-url https://download.pytorch.org/whl/cpu && \
6770
python3.14 -m pip install torcheval && \

0 commit comments

Comments
 (0)