| | --- |
| | library_name: tensorrt-rtx |
| | license: apache-2.0 |
| | base_model: black-forest-labs/FLUX.1-dev |
| | tags: |
| | - tensorrt-rtx |
| | - flux1 |
| | - fp4 |
| | - dev |
| | - optimized |
| | inference: false |
| | --- |
| | |
| | # FLUX1 TensorRT-RTX: DEV-Fp4 π¨ Building |
| |
|
| | Optimized TensorRT-RTX engines for **FLUX1** on **Fp4** architecture with **DEV** quantization. |
| |
|
| | ## π― This Repository |
| |
|
| | **One variant, one download** - only get exactly what you need! |
| |
|
| | - **Model**: FLUX1 |
| | - **Architecture**: Fp4 (Compute Capability 8.0+) |
| | - **Quantization**: DEV |
| | - **Memory**: TBD |
| | - **Speed**: TBD for 1024x1024 generation |
| |
|
| | ## π Quick Start |
| |
|
| | ### Automatic (Recommended) |
| |
|
| | ```bash |
| | # ImageAI server downloads automatically |
| | curl -X POST "http://localhost:8001/generate" \ |
| | -H "Content-Type: application/json" \ |
| | -d '{ |
| | "prompt": "a beautiful landscape", |
| | "model": "flux1-tensorrt_rtx:dev", |
| | "width": 1024, |
| | "height": 1024 |
| | }' |
| | ``` |
| |
|
| | ### Manual Download |
| |
|
| | ```python |
| | from huggingface_hub import snapshot_download |
| | |
| | # Download this specific variant only |
| | engines_path = snapshot_download( |
| | repo_id="imgailab/flux1-trtx-dev-fp4-blackwell" |
| | ) |
| | |
| | # Engines are in: engines_path/engines/*.plan |
| | ``` |
| |
|
| | ### Direct Integration |
| |
|
| | ```python |
| | from imageai_server.tensorrt.nvidia_sdxl_pipeline import NVIDIASDXLPipeline |
| | |
| | pipeline = NVIDIASDXLPipeline() |
| | pipeline.load_engines( |
| | engine_dir=f"{engines_path}/engines", |
| | framework_model_dir=f"{engines_path}/framework", |
| | onnx_dir=f"{engines_path}/onnx" |
| | ) |
| | pipeline.activate_engines() |
| | |
| | images, time_ms = pipeline.infer( |
| | prompt="a serene mountain landscape", |
| | height=1024, |
| | width=1024 |
| | ) |
| | ``` |
| |
|
| | ## π Performance |
| |
|
| | | Metric | Value | |
| | |--------|-------| |
| | | **Memory Usage** | TBD | |
| | | **Inference Speed** | TBD | |
| | | **Resolution** | 1024x1024 (optimized) | |
| | | **Batch Size** | 1 (optimized) | |
| | | **Precision** | DEV | |
| |
|
| | ## π§ Requirements |
| |
|
| | ### Hardware |
| | - **GPU**: Fp4 architecture |
| | - Ampere: RTX 3090, A100, etc. |
| | - Ada Lovelace: RTX 4090, etc. |
| | - Blackwell: H200, etc. |
| | - **VRAM**: TBD minimum |
| | - **Compute Capability**: 8.0+ |
| |
|
| | ### Software |
| | - **TensorRT-RTX**: 1.0.0.21+ |
| | - **CUDA**: 12.0+ |
| | - **Python**: 3.8+ |
| |
|
| | ## π Repository Structure |
| |
|
| | ``` |
| | flux1-trtx-dev-fp4-blackwell/ |
| | βββ engines/ # TensorRT engine files |
| | β βββ *.plan # Optimized engines |
| | βββ config.json # Configuration metadata |
| | βββ README.md # This file |
| | ``` |
| |
|
| | ## π Related Repositories |
| |
|
| | Other variants for FLUX1: |
| | - [Ampere BF16](https://huggingface.co/imgailab/flux1-trtx-bf16-ampere)\n- [Ada FP8](https://huggingface.co/imgailab/flux1-trtx-fp8-ada)\n- [Ada BF16](https://huggingface.co/imgailab/flux1-trtx-bf16-ada)\n- [Blackwell FP4](https://huggingface.co/imgailab/flux1-trtx-fp4-blackwell)\n- [Blackwell FP8](https://huggingface.co/imgailab/flux1-trtx-fp8-blackwell)\n- [Blackwell BF16](https://huggingface.co/imgailab/flux1-trtx-bf16-blackwell)\n |
| |
|
| | ## π License |
| |
|
| | Inherits license from base model: [black-forest-labs/FLUX.1-dev](https://huggingface.co/black-forest-labs/FLUX.1-dev) |
| |
|
| | ## π Updates |
| |
|
| | - **2025-08-12**: Initial release |
| | - Optimized for single-variant downloads |
| |
|
| | --- |
| |
|
| | *Part of the ImageAI TensorRT-RTX engine collection* |
| |
|