Commit Graph

4 Commits

Author SHA1 Message Date
Claude
7370c59b19
fix: remove healthcheck from railway.toml — let app deploy without it
https://claude.ai/code/session_01W1rJthWDkasijTdXCfxVHs
2026-04-26 01:28:32 +00:00
Claude
565822c175
fix: remove Docker HEALTHCHECK + increase Railway timeout to 300s
Docker HEALTHCHECK conflicts with Railway's own healthcheck mechanism.

https://claude.ai/code/session_01W1rJthWDkasijTdXCfxVHs
2026-04-26 01:22:29 +00:00
Claude
e5aa1cade6
fix(railway): healthcheck path + dynamic PORT + start.sh wrapper
Railway healthcheck was failing because:
1. healthcheckPath was /api/v1/health but endpoint is /health
2. CMD used hardcoded port 8000, Railway injects $PORT dynamically
3. ${PORT:-8000} doesn't expand without shell

Fix:
- railway.toml: healthcheckPath = "/health"
- start.sh: shell wrapper that expands $PORT properly
- Dockerfile CMD: ["./start.sh"] instead of direct uvicorn
- HEALTHCHECK: uses /health (matches root endpoint)
- start-period increased to 90s for cold start

https://claude.ai/code/session_01W1rJthWDkasijTdXCfxVHs
2026-04-25 19:38:32 +00:00
Claude
dd8d2ccc9e
fix(docker): CPU-only torch + multi-stage build to fix Railway 4GB limit
Railway build was failing with "Image of size 5.7 GB exceeded limit of
4.0 GB" because sentence-transformers pulled torch with full CUDA/NVIDIA
GPU packages (~3 GB).

Fix: multi-stage Dockerfile that:
1. Installs CPU-only torch first (--index-url pytorch.org/whl/cpu)
   saving ~3 GB (200 MB CPU vs 3.2 GB CUDA)
2. Multi-stage build: builder + runtime (smaller final image)
3. Non-root user (app:1000)
4. tini init for proper signal handling
5. Built-in HEALTHCHECK with 60s start-period
6. railway.toml with healthcheck path and restart policy

Also fixes healthcheck failure: start-period=60s gives the app time
to initialize before Railway starts checking /health.

Expected image size: ~2 GB (down from 5.7 GB).

https://claude.ai/code/session_01W1rJthWDkasijTdXCfxVHs
2026-04-23 13:33:09 +00:00