Home Community Meet TextDeformer: An AI Framework For Text-guided 3D Mesh Deformation

Meet TextDeformer: An AI Framework For Text-guided 3D Mesh Deformation

0
Meet TextDeformer: An AI Framework For Text-guided 3D Mesh Deformation

Three-dimensional (3D) meshes are a primary component of computer graphics and 3D modeling and have several fields of application, including architecture, automotive design, video game development, and film production. A mesh is a digital representation of a three-dimensional object comprising a group of vertices, edges, and faces that outline its shape and structure. The vertices represent the points in space where the perimeters meet, while the faces define the item’s surface. 

Since creating 3D meshes is difficult, it will likely be reserved for experts with special artistic skills. This suggests that an individual would find it difficult to create 3D meshes from zero without this data. The web makes it possible to seek out diverse datasets with 3D objects crafted by digital artists. Nonetheless, when customization (even minimal) is required, the editing process is as arduous as plain creation.

For that reason, the issue of deforming meshes is a subject that has received an important deal of attention in computer graphics and geometry processing. In lots of existing AI techniques, a user can manipulate deformations through control handles, allowing coarse, low-frequency deformations that preserve details. These are commonly known as detail-preserving deformations. Nonetheless, in 3D modeling, it is usually obligatory to include fantastic geometric information, which could be time-consuming and complex, even for expert artists.

🚀 JOIN the fastest ML Subreddit Community

On this sense, a novel AI approach, termed TextDeformer, has been proposed to automate the deformation strategy of 3D meshes. TextDeformer goals to rework a given source shape to a desired goal shape while maintaining semantic consistency between the 2. An outline of the system workflow and architecture is presented below.

This approach relies on the success of recent text-guided generative techniques and NeRFs (Neural Radiance Fields) but doesn’t require 3D training data. As an alternative, the authors use differentiable rendering with pre-trained image encoders like CLIP to regulate and optimize the geometry of the rendered objects.

After deformation, the structure and properties of the source mesh are preserved, and the resulting geometry adheres to the text specifications. This work differs from previous ones within the sort of task the model performs. Unlike previous text-guided works that generate geometry from scratch or add detail while preserving input mesh geometry, TextDeformer focuses on the deformation task.

Intimately, this framework is designed to change an existing input shape to create high-quality geometry that accurately reflects the source mesh. As well as, it could possibly produce low-frequency shape changes and high-frequency details, reminiscent of elongating a cow’s neck when deforming it right into a giraffe or adding scales when deforming into an alligator. The authors insist that the resulting correspondences from the source shape to the goal are continuous and semantically meaningful (e.g., “leg deforms to leg”) by coloring the source mesh, which is visible throughout the visualizations.

Some examples of the produced results reported by the authors of this work are illustrated within the figure below. Moreover, this figure features a comparison between TextDeformer and the state-of-the-art DreamFusion.

This was the summary of TextDeformer, a novel AI framework to enable accurate text-guided 3D mesh deformation. In case you have an interest, you possibly can learn more about this system within the links below.


Take a look at the Paper. Don’t forget to affix our 20k+ ML SubRedditDiscord Channel, and Email Newsletter, where we share the most recent AI research news, cool AI projects, and more. If you’ve any questions regarding the above article or if we missed anything, be at liberty to email us at Asif@marktechpost.com

🚀 Check Out 100’s AI Tools in AI Tools Club


Daniele Lorenzi received his M.Sc. in ICT for Web and Multimedia Engineering in 2021 from the University of Padua, Italy. He’s a Ph.D. candidate on the Institute of Information Technology (ITEC) on the Alpen-Adria-Universität (AAU) Klagenfurt. He’s currently working within the Christian Doppler Laboratory ATHENA and his research interests include adaptive video streaming, immersive media, machine learning, and QoS/QoE evaluation.


LEAVE A REPLY

Please enter your comment!
Please enter your name here