Back to Top

TRELLIS 2: Microsoft soars to Image-to-3D AI Generation

Updated 27 January 2026

In the field of AI-generated 3D content, which continues to rapidly develop, Microsoft has introduced the open-source model, TRELLIS 2.

A 4B parameter Artifical Intelligence model which transforms a single image into detailed 3d objects.

It employs an innovative process that creates realistic and highly detailed 3D models in a matter of seconds and fully textured.

TRELLIS 2 will make creation of 3D models easier to game developers, digital artists, and researchers.

It is quicker and more manageable and is able to process shapes that the outdated practices struggle to handle.

Start your headless eCommerce
now.
Find out More

What Makes TRELLIS 2 Stand Out?

In simple terms, TRELLIS 2 transforms a single 2D image into an elaborate 3D image .

The system adds effects of color, glossiness, metallic surface, and transparency.

The TRELLIS 2 O-Voxel system replaces older field-based systems, such as signed distance fields (SDFs).

This method avoids the problems that the older methods experienced with open surfaces and complicated interior geometry.

It enables TRELLIS 2 to enjoy processing all kinds of geometry, whether they are thin, hollow, or organic, without error which is vital in providing real appearance.

Key features include:

1) High Quality, Resolution & Efficiency

TRELLIS 2 a 4B-parameter model generates high-resolution fully textured assets with exceptional fidelity and efficiency with vanilla DiTs.

2) Arbitrary Topology Handling:

This method robustly handles complex structures, including open surfaces, non-manifold geometry, and enclosed interior structures, breaking the constraints of iso-surface fields.

3) Minimalist 3D Asset Pre- and Post-processing

Data processing for training and inference are simple, enabling instant conversions that are fully rendering-free and optimization-free

4) Rich Texture Modeling

These method can model arbitrary surface attributes such as Base Color, Roughness, Metallic, and Opacity , enabling Physically Based Rendering (PBR) and photorealistic relighting.

A Technical Deep Dive

Architecture of Trellis 2

Image Source : TRELLIS2@https://microsoft.github.io

The design of Trellis 2 is the use of O-Voxel shapes and a flow-matching transformer.

O-Voxel represents the geometry and appearance of a model in a small, efficient format, handling sharp edges and complicated shapes.

The O-Voxel data is then fed into SC-VAE which further compressed data into highly structured data that is easy to use in the development of large 3D models.

Its pipeline is simple: a textured 3D model can be transformed into O -Voxel in less than ten seconds on a CPU .

Encoded with the help of the SC -VAE and decoded with the help of Diffusion Transformers (DiTs).

It uses shape-based texturing in the creation of models, which enables users to create PBR maps to any geometry.

TRELLIS 2 was trained using the Objaverse-xl and Sketchfab dataset. It learned shapes and textures separately and then learned sparse structures through flow modelling.

Real-life Benchmarks & Performance of Trellis 2

The users claim that TRELLIS 2 is the highest-ranking open-source image-to-3-D converting model and outperforms many proprietary ones

However the higher-quality models like Hunyuan 3.0 sometimes do achieve higher performance.

Trellis 2 has the capability to generate GLB files directly, thus making it easy to integrate into other tools such as Blender and Unity.

Reddit and YouTube critics applaud that it can run on high-end GPUs and interfaces like ComfyUI make its use easier.

Nevertheless, the generated models may have small holes.

That have to be fixed before 3-D printing, and the model does not improve the style of the training data.

TRELLIS 2 is based on Linux and has a minimum of 24GB of VRAM.

The Future of 3D Generation

TRELLIS 2 is a major advance in high-speed, system-on-chip 3-D AI. It is capable of reshaping several industries in the form of time-saving, reduced skill requirement in producing 3-D assets

It has a MIT license, which permits anyone to contribute. The roadmap includes the plans to speed up the performance and expand platform support.

It is not perfect but is open-source, which speeds up the process of improvement. Observe its development to understand what the further moves in the sphere of AI-based 3-D creation are going to be.

To get latest Advancements in AI visit webkul !!

. . .

Leave a Comment

Your email address will not be published. Required fields are marked*


Be the first to comment.

Back to Top

Message Sent!

If you have more details or questions, you can reply to the received confirmation email.

Back to Home