Adding a Model
This document describes how to add a model in TensorRT-LLM.
TensorRT-LLM provides:
Low-level functions, for example,
concat
,add
, andsum
.Basic layers, such as,
Linear
andLayerNorm
.High-level layers, such as,
MLP
andAttention
.
Steps
Create a model directory in
tensorrt_llm/tensorrt_llm/models
, for examplebloom
.Write a
model.py
with TensorRT-LLM low level functions and basic layers. It’s optional to use high level layers.