🖼️ image

Long-LRM

Name: Long-LRM
Brand: Long-LRM
Price: 免费 CNY
Availability: InStock

Efficient 3D Gaussian reconstruction model to achieve rapid reconstruction of large scenes

#machine learning

#image processing

#deep learning

#3D reconstruction

#Gaussian model

Try Now

Product Details

Long-LRM is a model for 3D Gaussian reconstruction capable of reconstructing large scenes from a sequence of input images. The model can process 32 source images at 960x540 resolution in 1.3 seconds and runs on only a single A100 80G GPU. It combines the latest Mamba2 module and the traditional transformer module, and improves efficiency while ensuring quality through efficient token merging and Gaussian pruning steps. Compared with traditional feedforward models, Long-LRM is able to reconstruct the entire scene at once instead of only a small part of the scene. On large-scale scene datasets, such as DL3DV-140 and Tanks and Temples, Long-LRM's performance is comparable to optimization-based methods while improving efficiency by two orders of magnitude.

Main Features

处理高达32张高分辨率输入图像，实现快速3D场景重建

采用Mamba2块和transformer块的混合架构，提高token处理能力

通过token合并和高斯修剪步骤，平衡重建质量和效率

单次前馈步骤即可重建整个场景，无需多次迭代

在大规模场景数据集上具有与优化方法相媲美的性能

提高了两个数量级的效率，显著减少计算资源消耗

支持广泛的视图覆盖和高质量的照片级真实感重建

How to Use

1. Prepare a series of input images of the scene to be reconstructed, with a resolution of at least 960x540.

2. Make sure you have compatible GPU hardware, such as A100 80G GPU.

3. Load the input image and Long-LRM model into the computing environment.

4. Configure model parameters, including token merging strategy and Gaussian pruning threshold.

5. 运行Long-LRM模型，等待模型处理输入图像并生成3D重建结果。

6. 查看和评估重建的3D场景，根据需要进行后处理和优化。

7. 将重建的3D场景应用于所需的领域，如3D打印、虚拟现实或游戏开发。

Target Users

The target audience is 3D modelers, game developers, virtual reality content creators, and any professionals who need fast and efficient 3D scene reconstruction. Long-LRM's high efficiency and high-quality reconstruction capabilities enable these users to create realistic 3D scenes in a short time, accelerate the product development process, and improve work efficiency.

Examples

✓

使用Long-LRM从一系列城市街景图片中快速重建出3D城市模型。

✓

在游戏开发中，利用Long-LRM从实拍图片中重建游戏场景，提高场景的真实感。

✓

虚拟现实内容创作者使用Long-LRM从多角度拍摄的图片中重建出高精度的虚拟环境。

Quick Access

Visit Website →

Related Recommendations

Discover more similar quality AI tools

Google CameraTrapAI

Google CameraTrapAI is a collection of AI models for wildlife image classification. It identifies animal species from images captured by motion-triggered wildlife cameras (camera traps). This technology is of great significance to wildlife monitoring and conservation work, and can help researchers and conservation workers process large amounts of image data more efficiently, save time and improve work efficiency. The model is developed based on deep learning technology and has high accuracy and powerful classification capabilities.

Long-LRM

Product Details

Main Features

How to Use

Target Users

Examples

Quick Access

Categories

Related Recommendations

Google CameraTrapAI

SRM

Magma-8B

ZeroBench

MILS

InternVL2_5-26B-MPO

DeepSeek-VL2-Tiny

DeepSeek-VL2

Aquila-VL-2B-llava-qwen

Sparsh

DreamClear

DocLayout-YOLO

VisRAG

Proofig AI

HyFluid

MyLens