immich/machine-learning/app/main.py

import asyncio
import logging
import os
from concurrent.futures import ThreadPoolExecutor
from typing import Any

import orjson
import uvicorn
from fastapi import FastAPI, Form, HTTPException, UploadFile
from fastapi.responses import ORJSONResponse
from starlette.formparsers import MultiPartParser

from app.models.base import InferenceModel

from .config import log, settings
from .models.cache import ModelCache
from .schemas import (
    MessageResponse,
    ModelType,
    TextResponse,
)

MultiPartParser.max_file_size = 2**24  # spools to disk if payload is 16 MiB or larger
app = FastAPI()


def init_state() -> None:
    app.state.model_cache = ModelCache(ttl=settings.model_ttl, revalidate=settings.model_ttl > 0)
    log.info(
        (
            "Created in-memory cache with unloading "
            f"{f'after {settings.model_ttl}s of inactivity' if settings.model_ttl > 0 else 'disabled'}."
        )
    )
    # asyncio is a huge bottleneck for performance, so we use a thread pool to run blocking code
    app.state.thread_pool = ThreadPoolExecutor(settings.request_threads)
    log.info(f"Initialized request thread pool with {settings.request_threads} threads.")


@app.on_event("startup")
async def startup_event() -> None:
    init_state()


@app.get("/", response_model=MessageResponse)
async def root() -> dict[str, str]:
    return {"message": "Immich ML"}


@app.get("/ping", response_model=TextResponse)
def ping() -> str:
    return "pong"


@app.post("/predict")
async def predict(
    model_name: str = Form(alias="modelName"),
    model_type: ModelType = Form(alias="modelType"),
    options: str = Form(default="{}"),
    text: str | None = Form(default=None),
    image: UploadFile | None = None,
) -> Any:
    if image is not None:
        inputs: str | bytes = await image.read()
    elif text is not None:
        inputs = text
    else:
        raise HTTPException(400, "Either image or text must be provided")

    model: InferenceModel = await app.state.model_cache.get(model_name, model_type, **orjson.loads(options))
    outputs = await run(model, inputs)
    return ORJSONResponse(outputs)


async def run(model: InferenceModel, inputs: Any) -> Any:
    return await asyncio.get_running_loop().run_in_executor(app.state.thread_pool, model.predict, inputs)


if __name__ == "__main__":
    is_dev = os.getenv("NODE_ENV") == "development"
    uvicorn.run(
        "app.main:app",
        host=settings.host,
        port=settings.port,
        reload=is_dev,
        workers=settings.workers,
        log_config=None,
        access_log=log.isEnabledFor(logging.INFO),
    )
feat(ml)!: switch image classification and CLIP models to ONNX (#3809) 2023-08-25 04:28:51 +00:00			`import asyncio`
chore(ml): improved logging (#3918) * fixed `minScore` not being set correctly * apply to init * don't send `enabled` * fix eslint warning * added logger * added logging * refinements * enable access log for info level * formatting * merged strings --------- Co-authored-by: Alex <alex.tran1502@gmail.com> 2023-08-30 08:22:01 +00:00			`import logging`
feat: facial recognition (#2180) 2023-05-17 17:07:17 +00:00			`import os`
feat(ml)!: switch image classification and CLIP models to ONNX (#3809) 2023-08-25 04:28:51 +00:00			`from concurrent.futures import ThreadPoolExecutor`
chore(ml): updated dockerfile, added typing, packaging (#2642) * updated dockerfile, added typing, packaging apply env change * added arm64 support * added ml version pump, second try for arm64 * added linting config to pyproject.toml * renamed ml input field * fixed linter config * fixed dev docker compose 2023-06-05 14:40:48 +00:00			`from typing import Any`
feat(ml): model unloading (#2661) * model cache * fixed revalidation when using cache namespace * fixed ttl not being set, added lock 2023-06-07 01:48:51 +00:00
feat(ml)!: customizable ML settings (#3891) * consolidated endpoints, added live configuration * added ml settings to server * added settings dashboard * updated deps, fixed typos * simplified modelconfig updated tests * Added ml setting accordion for admin page updated tests * merge `clipText` and `clipVision` * added face distance setting clarified setting * add clip mode in request, dropdown for face models * polished ml settings updated descriptions * update clip field on error * removed unused import * add description for image classification threshold * pin safetensors for arm wheel updated poetry lock * moved dto * set model type only in ml repository * revert form-data package install use fetch instead of axios * added slotted description with link updated facial recognition description clarified effect of disabling tasks * validation before model load * removed unnecessary getconfig call * added migration * updated api updated api updated api --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-08-29 13:58:00 +00:00			`import orjson`
refactor(ml): modularization and styling (#2835) * basic refactor and styling * removed batching * module entrypoint * removed unused imports * model superclass, model cache now in app state * fixed cache dir and enforced abstract method --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-06-25 03:18:09 +00:00			`import uvicorn`
feat(ml)!: customizable ML settings (#3891) * consolidated endpoints, added live configuration * added ml settings to server * added settings dashboard * updated deps, fixed typos * simplified modelconfig updated tests * Added ml setting accordion for admin page updated tests * merge `clipText` and `clipVision` * added face distance setting clarified setting * add clip mode in request, dropdown for face models * polished ml settings updated descriptions * update clip field on error * removed unused import * add description for image classification threshold * pin safetensors for arm wheel updated poetry lock * moved dto * set model type only in ml repository * revert form-data package install use fetch instead of axios * added slotted description with link updated facial recognition description clarified effect of disabling tasks * validation before model load * removed unnecessary getconfig call * added migration * updated api updated api updated api --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-08-29 13:58:00 +00:00			`from fastapi import FastAPI, Form, HTTPException, UploadFile`
			`from fastapi.responses import ORJSONResponse`
			`from starlette.formparsers import MultiPartParser`
refactor(ml): modularization and styling (#2835) * basic refactor and styling * removed batching * module entrypoint * removed unused imports * model superclass, model cache now in app state * fixed cache dir and enforced abstract method --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-06-25 03:18:09 +00:00
feat(ml)!: switch image classification and CLIP models to ONNX (#3809) 2023-08-25 04:28:51 +00:00			`from app.models.base import InferenceModel`

chore(ml): improved logging (#3918) * fixed `minScore` not being set correctly * apply to init * don't send `enabled` * fix eslint warning * added logger * added logging * refinements * enable access log for info level * formatting * merged strings --------- Co-authored-by: Alex <alex.tran1502@gmail.com> 2023-08-30 08:22:01 +00:00			`from .config import log, settings`
refactor(ml): modularization and styling (#2835) * basic refactor and styling * removed batching * module entrypoint * removed unused imports * model superclass, model cache now in app state * fixed cache dir and enforced abstract method --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-06-25 03:18:09 +00:00			`from .models.cache import ModelCache`
			`from .schemas import (`
chore(ml): updated dockerfile, added typing, packaging (#2642) * updated dockerfile, added typing, packaging apply env change * added arm64 support * added ml version pump, second try for arm64 * added linting config to pyproject.toml * renamed ml input field * fixed linter config * fixed dev docker compose 2023-06-05 14:40:48 +00:00			`MessageResponse,`
refactor(ml): modularization and styling (#2835) * basic refactor and styling * removed batching * module entrypoint * removed unused imports * model superclass, model cache now in app state * fixed cache dir and enforced abstract method --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-06-25 03:18:09 +00:00			`ModelType,`
chore(ml): updated dockerfile, added typing, packaging (#2642) * updated dockerfile, added typing, packaging apply env change * added arm64 support * added ml version pump, second try for arm64 * added linting config to pyproject.toml * renamed ml input field * fixed linter config * fixed dev docker compose 2023-06-05 14:40:48 +00:00			`TextResponse,`
			`)`
feat(ml) backend takes image over HTTP (#2783) * using pydantic BaseSetting * ML API takes image file as input * keeping image in memory * reducing duplicate code * using bytes instead of UploadFile & other small code improvements * removed form-multipart, using HTTP body * format code --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-06-18 03:49:19 +00:00
feat(ml)!: customizable ML settings (#3891) * consolidated endpoints, added live configuration * added ml settings to server * added settings dashboard * updated deps, fixed typos * simplified modelconfig updated tests * Added ml setting accordion for admin page updated tests * merge `clipText` and `clipVision` * added face distance setting clarified setting * add clip mode in request, dropdown for face models * polished ml settings updated descriptions * update clip field on error * removed unused import * add description for image classification threshold * pin safetensors for arm wheel updated poetry lock * moved dto * set model type only in ml repository * revert form-data package install use fetch instead of axios * added slotted description with link updated facial recognition description clarified effect of disabling tasks * validation before model load * removed unnecessary getconfig call * added migration * updated api updated api updated api --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-08-29 13:58:00 +00:00			`MultiPartParser.max_file_size = 2**24 # spools to disk if payload is 16 MiB or larger`
feat: facial recognition (#2180) 2023-05-17 17:07:17 +00:00			`app = FastAPI()`

chore(ml): move to fastAPI (#2336) 2023-04-26 10:39:24 +00:00
chore(ml): added testing and github workflow (#2969) * added testing * github action for python, made mypy happy * formatted with black * minor fixes and styling * test model cache * cache test dependencies * narrowed model cache tests * moved endpoint tests to their own class * cleaned up fixtures * formatting * removed unused dep 2023-06-27 23:21:33 +00:00			`def init_state() -> None:`
fix(ml): race condition when loading models (#3207) * sync model loading, disabled model ttl by default * disable revalidation if model unloading disabled * moved lock 2023-07-11 17:01:21 +00:00			`app.state.model_cache = ModelCache(ttl=settings.model_ttl, revalidate=settings.model_ttl > 0)`
chore(ml): improved logging (#3918) * fixed `minScore` not being set correctly * apply to init * don't send `enabled` * fix eslint warning * added logger * added logging * refinements * enable access log for info level * formatting * merged strings --------- Co-authored-by: Alex <alex.tran1502@gmail.com> 2023-08-30 08:22:01 +00:00			`log.info(`
			`(`
			`"Created in-memory cache with unloading "`
			`f"{f'after {settings.model_ttl}s of inactivity' if settings.model_ttl > 0 else 'disabled'}."`
			`)`
			`)`
feat(ml)!: switch image classification and CLIP models to ONNX (#3809) 2023-08-25 04:28:51 +00:00			`# asyncio is a huge bottleneck for performance, so we use a thread pool to run blocking code`
			`app.state.thread_pool = ThreadPoolExecutor(settings.request_threads)`
chore(ml): improved logging (#3918) * fixed `minScore` not being set correctly * apply to init * don't send `enabled` * fix eslint warning * added logger * added logging * refinements * enable access log for info level * formatting * merged strings --------- Co-authored-by: Alex <alex.tran1502@gmail.com> 2023-08-30 08:22:01 +00:00			`log.info(f"Initialized request thread pool with {settings.request_threads} threads.")`
chore(ml): added testing and github workflow (#2969) * added testing * github action for python, made mypy happy * formatted with black * minor fixes and styling * test model cache * cache test dependencies * narrowed model cache tests * moved endpoint tests to their own class * cleaned up fixtures * formatting * removed unused dep 2023-06-27 23:21:33 +00:00

			`@app.on_event("startup")`
			`async def startup_event() -> None:`
			`init_state()`
refactor(ml): modularization and styling (#2835) * basic refactor and styling * removed batching * module entrypoint * removed unused imports * model superclass, model cache now in app state * fixed cache dir and enforced abstract method --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-06-25 03:18:09 +00:00
feat(ml) backend takes image over HTTP (#2783) * using pydantic BaseSetting * ML API takes image file as input * keeping image in memory * reducing duplicate code * using bytes instead of UploadFile & other small code improvements * removed form-multipart, using HTTP body * format code --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-06-18 03:49:19 +00:00
chore(ml): updated dockerfile, added typing, packaging (#2642) * updated dockerfile, added typing, packaging apply env change * added arm64 support * added ml version pump, second try for arm64 * added linting config to pyproject.toml * renamed ml input field * fixed linter config * fixed dev docker compose 2023-06-05 14:40:48 +00:00			`@app.get("/", response_model=MessageResponse)`
			`async def root() -> dict[str, str]:`
chore(ml): move to fastAPI (#2336) 2023-04-26 10:39:24 +00:00			`return {"message": "Immich ML"}`


chore(ml): updated dockerfile, added typing, packaging (#2642) * updated dockerfile, added typing, packaging apply env change * added arm64 support * added ml version pump, second try for arm64 * added linting config to pyproject.toml * renamed ml input field * fixed linter config * fixed dev docker compose 2023-06-05 14:40:48 +00:00			`@app.get("/ping", response_model=TextResponse)`
			`def ping() -> str:`
feat(machine-learning)!: move machine learning to Python based image (#1774) BREAKING CHANGES * Users have to update the docker-compose file, machine-learning portion. * Temporary dropping machine-learning support for Arm64 and Armv7 2023-02-18 15:13:37 +00:00			`return "pong"`

feat(ml): env variables for tags, faces and eager startup (#2626) * env variables for tags, faces and eager startup * chore(server,ml): remove object detection job and endpoint (#2627) * removed object detection job * removed object detection endpoint * env variables for tags, faces and eager startup * download without caching models if not eager * simplified `get_cached_model` * re-added env for clip text model 2023-06-03 02:42:47 +00:00
feat(ml)!: customizable ML settings (#3891) * consolidated endpoints, added live configuration * added ml settings to server * added settings dashboard * updated deps, fixed typos * simplified modelconfig updated tests * Added ml setting accordion for admin page updated tests * merge `clipText` and `clipVision` * added face distance setting clarified setting * add clip mode in request, dropdown for face models * polished ml settings updated descriptions * update clip field on error * removed unused import * add description for image classification threshold * pin safetensors for arm wheel updated poetry lock * moved dto * set model type only in ml repository * revert form-data package install use fetch instead of axios * added slotted description with link updated facial recognition description clarified effect of disabling tasks * validation before model load * removed unnecessary getconfig call * added migration * updated api updated api updated api --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-08-29 13:58:00 +00:00			`@app.post("/predict")`
			`async def predict(`
			`model_name: str = Form(alias="modelName"),`
			`model_type: ModelType = Form(alias="modelType"),`
			`options: str = Form(default="{}"),`
			`text: str \| None = Form(default=None),`
			`image: UploadFile \| None = None,`
			`) -> Any:`
			`if image is not None:`
			`inputs: str \| bytes = await image.read()`
			`elif text is not None:`
			`inputs = text`
			`else:`
			`raise HTTPException(400, "Either image or text must be provided")`
feat(machine-learning)!: move machine learning to Python based image (#1774) BREAKING CHANGES * Users have to update the docker-compose file, machine-learning portion. * Temporary dropping machine-learning support for Arm64 and Armv7 2023-02-18 15:13:37 +00:00
feat(ml)!: customizable ML settings (#3891) * consolidated endpoints, added live configuration * added ml settings to server * added settings dashboard * updated deps, fixed typos * simplified modelconfig updated tests * Added ml setting accordion for admin page updated tests * merge `clipText` and `clipVision` * added face distance setting clarified setting * add clip mode in request, dropdown for face models * polished ml settings updated descriptions * update clip field on error * removed unused import * add description for image classification threshold * pin safetensors for arm wheel updated poetry lock * moved dto * set model type only in ml repository * revert form-data package install use fetch instead of axios * added slotted description with link updated facial recognition description clarified effect of disabling tasks * validation before model load * removed unnecessary getconfig call * added migration * updated api updated api updated api --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-08-29 13:58:00 +00:00			`model: InferenceModel = await app.state.model_cache.get(model_name, model_type, **orjson.loads(options))`
			`outputs = await run(model, inputs)`
			`return ORJSONResponse(outputs)`
feat(ml): env variables for tags, faces and eager startup (#2626) * env variables for tags, faces and eager startup * chore(server,ml): remove object detection job and endpoint (#2627) * removed object detection job * removed object detection endpoint * env variables for tags, faces and eager startup * download without caching models if not eager * simplified `get_cached_model` * re-added env for clip text model 2023-06-03 02:42:47 +00:00

feat(ml)!: customizable ML settings (#3891) * consolidated endpoints, added live configuration * added ml settings to server * added settings dashboard * updated deps, fixed typos * simplified modelconfig updated tests * Added ml setting accordion for admin page updated tests * merge `clipText` and `clipVision` * added face distance setting clarified setting * add clip mode in request, dropdown for face models * polished ml settings updated descriptions * update clip field on error * removed unused import * add description for image classification threshold * pin safetensors for arm wheel updated poetry lock * moved dto * set model type only in ml repository * revert form-data package install use fetch instead of axios * added slotted description with link updated facial recognition description clarified effect of disabling tasks * validation before model load * removed unnecessary getconfig call * added migration * updated api updated api updated api --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-08-29 13:58:00 +00:00			`async def run(model: InferenceModel, inputs: Any) -> Any:`
feat(ml)!: switch image classification and CLIP models to ONNX (#3809) 2023-08-25 04:28:51 +00:00			`return await asyncio.get_running_loop().run_in_executor(app.state.thread_pool, model.predict, inputs)`


feat(machine-learning)!: move machine learning to Python based image (#1774) BREAKING CHANGES * Users have to update the docker-compose file, machine-learning portion. * Temporary dropping machine-learning support for Arm64 and Armv7 2023-02-18 15:13:37 +00:00			`if __name__ == "__main__":`
chore(ml): load models on start up (#2487) * chore(ml): load models on start up * Download correct model 2023-05-20 03:37:01 +00:00			`is_dev = os.getenv("NODE_ENV") == "development"`
feat(ml) backend takes image over HTTP (#2783) * using pydantic BaseSetting * ML API takes image file as input * keeping image in memory * reducing duplicate code * using bytes instead of UploadFile & other small code improvements * removed form-multipart, using HTTP body * format code --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-06-18 03:49:19 +00:00			`uvicorn.run(`
refactor(ml): modularization and styling (#2835) * basic refactor and styling * removed batching * module entrypoint * removed unused imports * model superclass, model cache now in app state * fixed cache dir and enforced abstract method --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-06-25 03:18:09 +00:00			`"app.main:app",`
feat(ml) backend takes image over HTTP (#2783) * using pydantic BaseSetting * ML API takes image file as input * keeping image in memory * reducing duplicate code * using bytes instead of UploadFile & other small code improvements * removed form-multipart, using HTTP body * format code --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-06-18 03:49:19 +00:00			`host=settings.host,`
			`port=settings.port,`
			`reload=is_dev,`
			`workers=settings.workers,`
chore(ml): improved logging (#3918) * fixed `minScore` not being set correctly * apply to init * don't send `enabled` * fix eslint warning * added logger * added logging * refinements * enable access log for info level * formatting * merged strings --------- Co-authored-by: Alex <alex.tran1502@gmail.com> 2023-08-30 08:22:01 +00:00			`log_config=None,`
			`access_log=log.isEnabledFor(logging.INFO),`
feat(ml) backend takes image over HTTP (#2783) * using pydantic BaseSetting * ML API takes image file as input * keeping image in memory * reducing duplicate code * using bytes instead of UploadFile & other small code improvements * removed form-multipart, using HTTP body * format code --------- Co-authored-by: Alex Tran <alex.tran1502@gmail.com> 2023-06-18 03:49:19 +00:00			`)`