Optional
Audio tokens.
Document tokens. e.g. PDF
Image (non-video) tokens.
Text tokens. Does not need to be reported, but some models will do so.
Video tokens.
Audio tokens.