Skip to main content

cortex run

The lazy dev's way to run models. Does three things:

  1. cortex pull: Grabs the model if you don't have it
  2. cortex engines install: Sets up engines if missing
  3. cortex models start: Fires up the model

Usage

info

Need the gory details? Use --verbose flag like this: cortex --verbose [subcommand]


cortex run [options] <model_id>

Options

OptionDescriptionRequiredDefault valueExample
model_idThe identifier of the model you want to chat with.Yes-mistral
--gpusList of GPUs to use.No-[0,1]
--ctx_lenMaximum context length for inference.Nomin(8192, max_model_context_length)1024
-h, --helpDisplay help information for the command.No--h
-d, --detachedLoad the model without starting an interactive chatNo--d