imgailab
/

flux1-trtx-dev-fp4-blackwell

Model card Files Files and versions

flux1-trtx-dev-fp4-blackwell / README.md

Mitchins's picture

Update README for flux1-dev-fp4-blackwell

391fa05 verified 7 months ago

|

history blame contribute delete

3.14 kB

	---
	library_name: tensorrt-rtx
	license: apache-2.0
	base_model: black-forest-labs/FLUX.1-dev
	tags:
	- tensorrt-rtx
	- flux1
	- fp4
	- dev
	- optimized
	inference: false
	---

	# FLUX1 TensorRT-RTX: DEV-Fp4 🔨 Building

	Optimized TensorRT-RTX engines for FLUX1 on Fp4 architecture with DEV quantization.

	## 🎯 This Repository

	One variant, one download - only get exactly what you need!

	- Model: FLUX1
	- Architecture: Fp4 (Compute Capability 8.0+)
	- Quantization: DEV
	- Memory: TBD
	- Speed: TBD for 1024x1024 generation

	## 🚀 Quick Start

	### Automatic (Recommended)

	```bash
	# ImageAI server downloads automatically
	curl -X POST "http://localhost:8001/generate" \
	-H "Content-Type: application/json" \
	-d '{
	"prompt": "a beautiful landscape",
	"model": "flux1-tensorrt_rtx:dev",
	"width": 1024,
	"height": 1024
	}'
	```

	### Manual Download

	```python
	from huggingface_hub import snapshot_download

	# Download this specific variant only
	engines_path = snapshot_download(
	repo_id="imgailab/flux1-trtx-dev-fp4-blackwell"
	)

	# Engines are in: engines_path/engines/*.plan
	```

	### Direct Integration

	```python
	from imageai_server.tensorrt.nvidia_sdxl_pipeline import NVIDIASDXLPipeline

	pipeline = NVIDIASDXLPipeline()
	pipeline.load_engines(
	engine_dir=f"{engines_path}/engines",
	framework_model_dir=f"{engines_path}/framework",
	onnx_dir=f"{engines_path}/onnx"
	)
	pipeline.activate_engines()

	images, time_ms = pipeline.infer(
	prompt="a serene mountain landscape",
	height=1024,
	width=1024
	)
	```

	## 📊 Performance

	\| Metric \| Value \|
	\|--------\|-------\|
	\| Memory Usage \| TBD \|
	\| Inference Speed \| TBD \|
	\| Resolution \| 1024x1024 (optimized) \|
	\| Batch Size \| 1 (optimized) \|
	\| Precision \| DEV \|

	## 🔧 Requirements

	### Hardware
	- GPU: Fp4 architecture
	- Ampere: RTX 3090, A100, etc.
	- Ada Lovelace: RTX 4090, etc.
	- Blackwell: H200, etc.
	- VRAM: TBD minimum
	- Compute Capability: 8.0+

	### Software
	- TensorRT-RTX: 1.0.0.21+
	- CUDA: 12.0+
	- Python: 3.8+

	## 📁 Repository Structure

	```
	flux1-trtx-dev-fp4-blackwell/
	├── engines/ # TensorRT engine files
	│ ├── *.plan # Optimized engines
	├── config.json # Configuration metadata
	└── README.md # This file
	```

	## 🌐 Related Repositories

	Other variants for FLUX1:
	- [Ampere BF16](https://huggingface.co/imgailab/flux1-trtx-bf16-ampere)\n- [Ada FP8](https://huggingface.co/imgailab/flux1-trtx-fp8-ada)\n- [Ada BF16](https://huggingface.co/imgailab/flux1-trtx-bf16-ada)\n- [Blackwell FP4](https://huggingface.co/imgailab/flux1-trtx-fp4-blackwell)\n- [Blackwell FP8](https://huggingface.co/imgailab/flux1-trtx-fp8-blackwell)\n- [Blackwell BF16](https://huggingface.co/imgailab/flux1-trtx-bf16-blackwell)\n

	## 📝 License

	Inherits license from base model: [black-forest-labs/FLUX.1-dev](https://huggingface.co/black-forest-labs/FLUX.1-dev)

	## 🔄 Updates

	- 2025-08-12: Initial release
	- Optimized for single-variant downloads

	---

	Part of the ImageAI TensorRT-RTX engine collection