What is Model Inference?

Question

Accepted Answer

The process of running a trained AI model to get a response — what happens every single time you send a message. 'Inference' is just the technical word for 'using the model.' Training is the expensive, one-time process of teaching the model. Inference is the ongoing, per-request process of getting answers out of it. When you're paying API costs, you're paying for inference.