Skip to content

Commit 86c9cbf

Browse files
tariq-hasantedhtchang
authored andcommitted
Support arm64 for Hugging Face trainer (kubeflow#2028)
* echoed disk usage before cleanup Signed-off-by: tariq-hasan <[email protected]> * pruned docker images Signed-off-by: tariq-hasan <[email protected]> * moved docker data directory Signed-off-by: tariq-hasan <[email protected]> * added arm64 in the list of platforms for trainer-huggingface Signed-off-by: tariq-hasan <[email protected]> --------- Signed-off-by: tariq-hasan <[email protected]>
1 parent b7aa507 commit 86c9cbf

File tree

2 files changed

+32
-4
lines changed

2 files changed

+32
-4
lines changed

.github/workflows/publish-core-images.yaml

+1-1
Original file line numberDiff line numberDiff line change
@@ -34,4 +34,4 @@ jobs:
3434
- component-name: trainer-huggingface
3535
dockerfile: sdk/python/kubeflow/trainer/Dockerfile
3636
context: sdk/python/kubeflow/trainer
37-
platforms: linux/amd64
37+
platforms: linux/amd64,linux/arm64

.github/workflows/template-publish-image/action.yaml

+31-3
Original file line numberDiff line numberDiff line change
@@ -28,16 +28,44 @@ runs:
2828
- name: Remove unnecessary files
2929
shell: bash
3030
run: |
31+
echo "Disk usage before cleanup:"
32+
df -hT
33+
3134
sudo rm -rf /usr/share/dotnet
3235
sudo rm -rf /opt/ghc
33-
sudo rm -rf "/usr/local/share/boost"
36+
sudo rm -rf /usr/local/share/boost
3437
sudo rm -rf "$AGENT_TOOLSDIRECTORY"
3538
sudo rm -rf /usr/local/lib/android
3639
sudo rm -rf /usr/local/share/powershell
3740
sudo rm -rf /usr/share/swift
38-
41+
3942
echo "Disk usage after cleanup:"
40-
df -h
43+
df -hT
44+
45+
- name: Prune docker images
46+
shell: bash
47+
run: |
48+
docker image prune -a -f
49+
docker system df
50+
df -hT
51+
52+
- name: Move docker data directory
53+
shell: bash
54+
run: |
55+
echo "Stopping docker service ..."
56+
sudo systemctl stop docker
57+
DOCKER_DEFAULT_ROOT_DIR=/var/lib/docker
58+
DOCKER_ROOT_DIR=/mnt/docker
59+
echo "Moving ${DOCKER_DEFAULT_ROOT_DIR} -> ${DOCKER_ROOT_DIR}"
60+
sudo mv ${DOCKER_DEFAULT_ROOT_DIR} ${DOCKER_ROOT_DIR}
61+
echo "Creating symlink ${DOCKER_DEFAULT_ROOT_DIR} -> ${DOCKER_ROOT_DIR}"
62+
sudo ln -s ${DOCKER_ROOT_DIR} ${DOCKER_DEFAULT_ROOT_DIR}
63+
echo "$(sudo ls -l ${DOCKER_DEFAULT_ROOT_DIR})"
64+
echo "Starting docker service ..."
65+
sudo systemctl daemon-reload
66+
sudo systemctl start docker
67+
echo "Docker service status:"
68+
sudo systemctl --no-pager -l -o short status docker
4169
4270
- name: Setup QEMU
4371
uses: docker/setup-qemu-action@v2

0 commit comments

Comments
 (0)