mirror of
https://github.com/x1xhlol/system-prompts-and-models-of-ai-tools.git
synced 2026-06-18 07:19:35 +00:00
Railway build was failing with "Image of size 5.7 GB exceeded limit of 4.0 GB" because sentence-transformers pulled torch with full CUDA/NVIDIA GPU packages (~3 GB). Fix: multi-stage Dockerfile that: 1. Installs CPU-only torch first (--index-url pytorch.org/whl/cpu) saving ~3 GB (200 MB CPU vs 3.2 GB CUDA) 2. Multi-stage build: builder + runtime (smaller final image) 3. Non-root user (app:1000) 4. tini init for proper signal handling 5. Built-in HEALTHCHECK with 60s start-period 6. railway.toml with healthcheck path and restart policy Also fixes healthcheck failure: start-period=60s gives the app time to initialize before Railway starts checking /health. Expected image size: ~2 GB (down from 5.7 GB). https://claude.ai/code/session_01W1rJthWDkasijTdXCfxVHs
47 lines
1.5 KiB
Docker
47 lines
1.5 KiB
Docker
# ── Stage 1: Builder ──────────────────────────────────
|
|
FROM python:3.12-slim AS builder
|
|
|
|
RUN apt-get update && apt-get install -y --no-install-recommends \
|
|
build-essential libpq-dev curl \
|
|
&& rm -rf /var/lib/apt/lists/*
|
|
|
|
WORKDIR /build
|
|
|
|
RUN python -m venv /opt/venv
|
|
ENV PATH="/opt/venv/bin:$PATH"
|
|
|
|
COPY requirements.txt ./
|
|
|
|
# Install CPU-only torch first (saves ~3 GB vs CUDA version)
|
|
RUN pip install --no-cache-dir --upgrade pip setuptools wheel \
|
|
&& pip install --no-cache-dir torch --index-url https://download.pytorch.org/whl/cpu \
|
|
&& pip install --no-cache-dir -r requirements.txt
|
|
|
|
# ── Stage 2: Runtime ─────────────────────────────────
|
|
FROM python:3.12-slim AS runtime
|
|
|
|
RUN apt-get update && apt-get install -y --no-install-recommends \
|
|
libpq5 curl tini \
|
|
&& rm -rf /var/lib/apt/lists/*
|
|
|
|
RUN groupadd --gid 1000 app \
|
|
&& useradd --uid 1000 --gid app --shell /bin/bash --create-home app
|
|
|
|
COPY --from=builder /opt/venv /opt/venv
|
|
ENV PATH="/opt/venv/bin:$PATH" \
|
|
PYTHONUNBUFFERED=1 \
|
|
PYTHONDONTWRITEBYTECODE=1
|
|
|
|
WORKDIR /app
|
|
COPY --chown=app:app . .
|
|
|
|
USER app
|
|
|
|
EXPOSE 8000
|
|
|
|
HEALTHCHECK --interval=30s --timeout=10s --start-period=60s --retries=3 \
|
|
CMD curl -f http://localhost:8000/api/v1/health || exit 1
|
|
|
|
ENTRYPOINT ["tini", "--"]
|
|
CMD ["uvicorn", "app.main:app", "--host", "0.0.0.0", "--port", "8000", "--workers", "2"]
|