Adding a Model
This document describes how to add a model in TensorRT-LLM.
TensorRT-LLM provides:
- Low-level functions, for example, - concat,- add, and- sum.
- Basic layers, such as, - Linearand- LayerNorm.
- High-level layers, such as, - MLPand- Attention.
Steps
- Create a model directory in - tensorrt_llm/tensorrt_llm/models, for example- bloom.
- Write a - model.pywith TensorRT-LLM low level functions and basic layers. It’s optional to use high level layers.