Subscribe to the AI Search Newsletter

Get top updates in AI to your inbox every weekend. It's free!

/ 3D

AI tools for 3D

Find and compare the top AI tools for 3D. Browse features, pricing, and user ratings of all the AI tools and apps in the market.

Newest

Interior AI

Redesign your interior in seconds using AI with Interior AI. Say goodbye to expensive interior designers and hello to a cost-effective and convenient way to transform your living space. Simply take a photo of your current interior and choose from a range of interior styles, from Modern to Minimalist to Contemporary. Interior AI will use its advanced AI technology to generate photorealistic renders of your redesigned space. You can even use virtual home staging to furnish empty homes for real estate listings. With Interior AI, you can turn your design ideas into reality with just a few clicks.

  • Redesign your interior in seconds using AI
  • Choose from a range of interior styles, including Modern, Minimalist, and Contemporary
  • Transform sketches and SketchUp files into photorealistic renders
  • Virtual home staging for real estate listings
  • Create 3D flythrough videos of your interior designs

52

AiOS (All-in-One-Stage)

AiOS is a novel approach to 3D whole-body human mesh recovery that aims to address limitations of existing two-stage methods. Developed by researchers from institutions including SenseTime Research, City University of Hong Kong, and Nanyang Technological University, AiOS performs human pose and shape estimation in a single stage, without requiring a separate human detection step.

The key innovation of AiOS is its all-in-one-stage design that processes the full image frame end-to-end. This is in contrast to previous top-down approaches that first detect and crop individual humans before estimating pose and shape. By operating on the full image, AiOS preserves important contextual information and inter-person relationships that can be lost when cropping. 

AiOS is built on the DETR (DEtection TRansformer) architecture and frames multi-person whole-body mesh recovery as a progressive set prediction problem. It uses a series of transformer decoder stages to localize humans and estimate their pose and shape parameters in a coarse-to-fine manner.

The first stage uses "human tokens" to identify coarse human locations and encode global features for each person. Subsequent stages refine these initial estimates, using "joint tokens" to extract more fine-grained local features around body parts. This progressive refinement allows AiOS to handle challenging cases like occlusions.

By estimating pose and shape for the full body, hands, and face in a unified framework, AiOS is able to capture expressive whole-body poses. It outputs parameters for the SMPL-X parametric human body model, providing a detailed 3D mesh representation of each person.

The researchers evaluated AiOS on several benchmark datasets for 3D human pose and shape estimation. Compared to previous state-of-the-art methods, AiOS achieved significant improvements, including a 9% reduction in normalized mesh vertex error (NMVE) on the AGORA dataset and a 30% reduction in per-vertex error (PVE) on EHF.

Key features of AiOS include:

  • Single-stage, end-to-end architecture for multi-person pose and shape estimation
  • Operates on full image frames without requiring separate human detection
  • Progressive refinement using transformer decoder stages
  • Unified estimation of body, hand, and face pose/shape
  • Outputs SMPL-X body model parameters
  • State-of-the-art performance on multiple 3D human pose datasets
  • Effective for challenging scenarios like occlusions and crowded scenes
  • Built on DETR transformer architecture

3

DIAMOND Diffusion for World Modeling

DIAMOND is an innovative reinforcement learning agent that is trained entirely within a diffusion world model. Developed by researchers from the University of Geneva, University of Edinburgh, and Microsoft Research, DIAMOND represents a significant advancement in world modeling for reinforcement learning.

The key innovation of DIAMOND is its use of a diffusion model to generate the world model, rather than relying on discrete latent variables like many previous approaches. This allows DIAMOND to capture more detailed visual information that can be crucial for reinforcement learning tasks. The diffusion world model takes in the agent's actions and previous frames to predict and generate the next frame of the environment.

DIAMOND was initially developed and tested on Atari games, where it achieved state-of-the-art performance. On the Atari 100k benchmark, which evaluates agents trained on only 100,000 frames of gameplay, DIAMOND achieved a mean human-normalized score of 1.46 - meaning it performed 46% better than human level and set a new record for agents trained entirely in a world model.

The resulting CS:GO world model can be played interactively at about 10 frames per second on an RTX 3090 GPU. While it has some limitations and failure modes, it demonstrates the potential for diffusion models to capture complex 3D environments.

Key features of DIAMOND include:

  • Diffusion-based world model that captures detailed visual information
  • State-of-the-art performance on Atari 100k benchmark
  • Ability to model both 2D and 3D game environments
  • End-to-end training of the reinforcement learning agent within the world model
  • Use of EDM sampling for stable trajectories with few denoising steps
  • Two-stage pipeline for modeling complex 3D environments
  • Interactive playability of generated world models
  • Open-source code and pre-trained models released for further research

1

CogVideo & CogVideoX

CogVideo and CogVideoX are advanced text-to-video generation models developed by researchers at Tsinghua University. These models represent significant advancements in the field of AI-powered video creation, allowing users to generate high-quality video content from text prompts.

CogVideo, the original model, is a large-scale pretrained transformer with 9.4 billion parameters. It was trained on 5.4 million text-video pairs, inheriting knowledge from the CogView2 text-to-image model. This inheritance significantly reduced training costs and helped address issues of data scarcity and weak relevance in text-video datasets. CogVideo introduced a multi-frame-rate training strategy to better align text and video clips, resulting in improved generation accuracy, particularly for complex semantic movements.

CogVideoX, an evolution of the original model, further refines the video generation capabilities. It uses a T5 text encoder to convert text prompts into embeddings, similar to other advanced AI models like Stable Diffusion 3 and Flux AI. CogVideoX also employs a 3D causal VAE (Variational Autoencoder) to compress videos into latent space, generalizing the concept used in image generation models to the video domain.

Both models are capable of generating high-resolution videos (480x480 pixels) with impressive visual quality and coherence. They can create a wide range of content, from simple animations to complex scenes with moving objects and characters. The models are particularly adept at generating videos with surreal or dreamlike qualities, interpreting text prompts in creative and unexpected ways.

One of the key strengths of these models is their ability to generate videos locally on a user's PC, offering an alternative to cloud-based services. This local generation capability provides users with more control over the process and potentially faster turnaround times, depending on their hardware.

Key features of CogVideo and CogVideoX include:

  • Text-to-video generation: Create video content directly from text prompts.
  • High-resolution output: Generate videos at 480x480 pixel resolution.
  • Multi-frame-rate training: Improved alignment between text and video for more accurate representations.
  • Flexible frame rate control: Ability to adjust the intensity of changes throughout continuous frames.
  • Dual-channel attention: Efficient finetuning of pretrained text-to-image models for video generation.
  • Local generation capability: Run the model on local hardware for faster processing and increased privacy.
  • Open-source availability: The code and model are publicly available for research and development.
  • Large-scale pretraining: Trained on millions of text-video pairs for diverse and high-quality outputs.
  • Inheritance from text-to-image models: Leverages knowledge from advanced image generation models.
  • State-of-the-art performance: Outperforms many publicly available models in human evaluations.

603

Kling AI

Kling AI is a cutting-edge AI platform that utilizes advanced 3D spatiotemporal joint attention mechanisms to model complex motions and generate high-quality video content. It supports up to 2-minute long videos with a frame rate of 30fps, simulates real-world physical characteristics, and produces cinema-grade video with 1080p resolution. This technology allows users to effortlessly create stunning videos with advanced Kling AI.

Currently, Kling AI is available for beta testing exclusively on the 'Kuaiying' app, with a web version to be released soon. To use Kling AI, users can join the beta by downloading the 'Kuaiying' app and signing up for access. The platform is capable of generating a wide range of video content, including those with significant motion, up to 2 minutes in length, and in various aspect ratios.

Kling AI's advanced technology allows it to simulate realistic physical characteristics and combine complex concepts to create unique and imaginative scenarios. It is also capable of generating cinema-grade videos with 1080p resolution, delivering stunning visuals from expansive scenes to detailed close-ups. With its flexible output video aspect ratios, Kling AI can meet the diverse needs of different video content scenarios.

Key features of Kling AI include:

  • Advanced 3D spatiotemporal joint attention mechanism
  • Generation of high-quality video content up to 2 minutes long with 30fps
  • Simulation of real-world physical characteristics
  • Cinema-grade video generation with 1080p resolution
  • Support for flexible video aspect ratios
  • Ability to combine complex concepts to create unique scenarios

1305

LivePortrait

LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control. Developed by a team from Kuaishou Technology, this framework aims to synthesize lifelike videos from single source images. Using an appearance reference and motion data derived from various inputs such as driving videos, audio, text, or generation, LivePortrait balances computational efficiency with controllability.

The key innovation lies in its implicit-keypoint-based framework, which diverges from mainstream diffusion-based methods to enhance generalization, controllability, and efficiency for practical applications.

The framework comprises two main stages: base model training and stitching and retargeting modules training. Initially, the appearance and motion extractors, warping module, and decoder are optimized from scratch. In the second stage, the stitching and retargeting modules are finely tuned while the previously trained components are frozen. This structured approach allows LivePortrait to achieve high-quality video generation with exceptional speed, as evidenced by its performance on an RTX 4090 GPU. The project also boasts an impressive dataset of around 69 million high-quality frames and employs a mixed image-video training strategy to further improve generation quality and generalization capabilities.

Key Features

  • Implicit-Keypoint-Based Framework: Balances computational efficiency and controllability, moving away from mainstream diffusion-based methods.
  • High-Quality Data: Uses approximately 69 million high-quality frames for training.
  • Mixed Training Strategy: Incorporates both images and videos in the training process.
  • Stitching Module: Enhances the generation quality by integrating additional data.
  • Retargeting Modules: Controls specific facial features like eyes and lips for more precise animations.
  • Generalization Across Styles: Supports various portrait styles including realistic, oil painting, sculpture, and 3D rendering.
  • Animal Fine-Tuning: Capable of animating animal portraits by fine-tuning on animal datasets.
  • Performance: Achieves a generation speed of 12.8ms on an RTX 4090 GPU.
  • Open Source: The inference code and models are available on GitHub.

858

MiniAiLive - Face Liveness Detection

MiniAiLive offers Face Liveness Detection through its 3D Face Anti-Spoofing SDK, providing a robust solution to safeguard against deepfake and spoofing attacks. By ensuring customer safety and authenticating only genuine users, this technology delivers results in seconds with a 5-second selfie verification process and a 99% accuracy rate, detecting account cloning, impersonation attacks, and facial liveness.

Use cases of MiniAiLive include:

  • Physical Access Control
  • Attendance System
  • Account Opening
  • User Onboarding
  • Fraud Prevention

9

AI Experts

AI Experts is a top AI agency that helps businesses integrate artificial intelligence to drive growth and streamline operations. With a team of real humans, we bridge the gap between cutting-edge AI technology and your business goals, offering tools and insights to transform your operations.

Use cases of AI Experts include:

  • Content creation: Produce high-quality content at scale with AI-driven tools for written articles, images, and videos.
  • Data analysis: Make informed decisions with actionable insights derived from complex datasets analyzed by AI algorithms.
  • Other use cases: 3D, advertising, AI agents, AI assistant, chatbot builder, copywriting, customer support, design, e-commerce, education, finance, gaming, health, marketing, productivity, SEO, social media, transcription, translation, video editing, and more.

11

DreamzAR AI Landscape Design Ideas

The AI Landscape Design Idea Generator is a powerful online tool that uses artificial intelligence to generate unique and creative landscape design ideas. The platform uses machine learning algorithms to analyze various design elements, such as colors, textures, and shapes, and combines them to create innovative and visually appealing designs.

Users can input their design preferences, such as style, color scheme, and plant types, and the AI algorithm will generate a customized landscape design concept. The platform also provides users with a 2D or 3D visual representation of their design, allowing them to visualize and refine their outdoor space.

Use cases of the AI Landscape Design Idea Generator include:

  • Generating unique and creative landscape design ideas
  • Creating customized designs based on user preferences
  • Visualizing and refining outdoor spaces with 2D or 3D representations
  • Exploring different design elements, such as colors, textures, and shapes
  • Getting inspiration for outdoor design projects

5

Eternity AC

Eternity.AC is an AI-powered platform that allows users to create a digital clone of themselves. This includes their thoughts, voice, and appearance with a lifelike 3D avatar. The creation process involves recording thoughts by answering 25 or more questions in one of the supported languages, uploading selfies from different angles to create a 3D avatar, and saving the personal clone to the cloud. Basic clones are free to make, and unlimited tuning and talking is available with the $20 “Plus” plan.

Key features of Eternity.AC include:

  • AI-powered digital clone creation
  • Lifelike 3D avatar creation from user-uploaded selfies
  • Recording of thoughts, experiences, and memories
  • Support for six languages: English, Spanish, Portuguese, Polish, Ukrainian, and Belarusian
  • Free basic clones with an optional “Plus” plan for unlimited tuning and talking
  • GDPR compliance and secure handling of user data

2

Avatar One

Avatar.One allows you to create your own dream AI girlfriend in immersive 3D. Whether you want to chat with an AI girlfriend now or design your own unique companion, Avatar.One offers a customizable and personalized experience. With advanced avatar creation tools and lifelike interactions, you can explore a new level of companionship in the digital world.

Key features of Avatar.One include:

  • Create a unique character: Design your AI girlfriend with thousands of possible variations and even give her a live voice.
  • Fully animated in immersive 3D: Experience your AI girlfriend in full 3D with unique emotes and interactions.
  • Save memories: Add memories together with your chatbot to enrich your relationship and create lasting moments.
  • Unfiltered chat and roleplay: Engage in uncensored conversations, flirtation, and roleplay scenarios with your AI companion.
  • Safe and private interactions: Avatar.One prioritizes user privacy and data security, ensuring a safe experience for all users.

2

OverScene

OverScene is a visual AI tool that seamlessly integrates into your workflow, enhancing your art, design, coding, and image creation processes. Priced at $49.99 with an introductory offer of $29.99 for a lifetime license, OverScene for Windows eliminates the need for plugins, subscriptions, and extensive learning curves, providing a user-friendly experience without vendor lock-in.

Use cases of OverScene include transforming sketches into masterpieces across various software like Paint, Photoshop, and Illustrator, enhancing 3D models in Blender, ZBrush, and Sketchup, effortlessly converting screenshots to code, and generating moodboards and prototypes in seconds. OverScene offers a one-click installation, access to a wide range of AI models, and complete creative freedom without any restrictions.

  • Slay your creative demons
  • Test ideas and generate moodboards
  • Transform sketches into masterpieces
  • Enhance 3D models with stunning detail
  • Convert screenshots to code effortlessly
  • Access a world of AI models
  • Enjoy complete creative freedom
  • Experience lightweight efficiency

13

Otherhalf

Introducing Otherhalf, the innovative interactive 3D AI companion that is set to revolutionize the way we engage with technology. Otherhalf combines cutting-edge artificial intelligence with a user-friendly interface to provide a unique and personalized experience for users. Whether you're looking for assistance with daily tasks, entertainment, or simply a friendly chat, Otherhalf is here to enhance your digital interactions.

Key features of Otherhalf include:

  • Advanced artificial intelligence technology for intelligent responses and interactions
  • Interactive 3D interface for a visually engaging experience
  • Personalized assistance tailored to individual user preferences
  • Entertainment options such as games, quizzes, and storytelling
  • Easy integration with various devices and platforms for seamless connectivity

12

Pietra Product Design Studio

Pietra Product Design Studio is a comprehensive platform that allows e-commerce brands to dream, design, and manufacture products with the help of AI technology. With over 250,000 brands and entrepreneurs already onboard, Pietra offers tools and resources to streamline the product design process, saving time and money for businesses.

Use cases of Pietra Product Design Studio include:

  • Create designs and variations in seconds using AI technology
  • Turn sketches into 3D designs for quick visualization
  • Collaborate with team members and suppliers for efficient communication
  • Access thousands of templates for design inspiration and faster ideation

15

Character Design By Museclip

Transform 3D base models into realistic characters in real-time. Unleash creativity with elements drag-and-drop, magic brush and smart text-prompt editing. Your imagination, now instant.

27

Luminar Neo

Luminar Neo is an innovative image editing application powered by artificial intelligence. It’s designed to simplify complex editing routines and enables creators to bring their boldest ideas to life. The software is accessible to everyone thanks to an intuitive and user-friendly interface. It offers a wide range of instruments including layers, masking, and local adjustments. It’s available for both Windows and macOS, and can also be used as a plugin for Photoshop & Lightroom.

Key features of Luminar Neo include:

  • Sky AI: This feature allows you to seamlessly replace the sky in your photo and add realistic sky reflections.
  • Accent AI: An intelligent tool that substitutes more than a dozen controls including Shadows, Highlights, Contrast, Tone, Saturation, Exposure, and Details.
  • Atmosphere AI: This feature lets you place fog, mist, or haze in your image with content-aware masking for maximum realism.
  • Relight AI: This tool allows you to fix backlit photos by building a 3D map of an image so you can easily adjust lighting and exposure based on depth.
  • Portrait Bokeh AI: This feature simulates the effect of an out-of-focus background behind your subject without needing expensive lenses.
  • Skin AI: This tool automatically retouches your subject’s skin and removes imperfections.

As for the pricing:

  • 1 Month Plan: $11.95 per month, billed monthly (Total $143/year)
  • 24 Months Plan: $6.21 per month, billed $149 every 2 years
  • 12 Months Plan: $8.25 per month, billed $99 yearly
  • One-Time Purchase Lifetime: $249 (One-time payment)

160

AnimateDiff

AnimateDiff is an AI tool that can animate personalized text-to-image diffusion models without specific tuning. It is a plug-and-play module that turns most community models into animation generators, without the need for additional training. The tool has developed four versions: v1, v2, and v3 for Stable Diffusion V1.5; sdxl-beta for Stable Diffusion XL.

The core of AnimateDiff’s framework is to append a newly-initialized motion modeling module to the frozen based text-to-image model, and train it on video clips. This allows for the animation of most existing personalized text-to-image models, saving the efforts in model-specific tuning.

Use cases for AnimateDiff include:

  • Text-to-video conversion: Users can input a text prompt and AnimateDiff will generate a corresponding animated video.
  • Animation creation: Users can animate static images, creating dynamic and engaging content.
  • Content creation: Content creators can use AnimateDiff to create unique videos for their audiences.

127

LLMPeople

Interact with 3d models powered by llms

10

Lucid Beads

Introducing Lucid Beads - a unique and purposeful way to create your dream bracelets. With Lucid Beads, you have the power to customize your own gemstone bracelets that not only look stunning but also hold spiritual significance. Each bracelet is handmade to order using natural gemstones and hypoallergenic elastic cord, ensuring both quality and comfort. Plus, you'll receive a free pouch and enjoy free shipping within the US. Lucid Beads also offers a 30-day return policy and the opportunity to earn rewards with every purchase. Whether you're seeking balance, elegance, or a touch of spirituality, Lucid Beads has everything you need to create a truly meaningful bracelet.

Key features of Lucid Beads include:

  • Create fully custom gemstone bracelets with Live 3D
  • Find the perfect gemstones for you with AI Assistant
  • Choose from about 100 all-natural gemstone beads
  • Handcrafted with love and FREE shipping in the U.S.

8

PortraitArt

Introducing PortraitArt: the #1 Photo to Art App that allows you to turn your photos into beautiful works of art! With PortraitArt, you can personalize your photos and transform them into stunning oil paintings, dreamy watercolors, awesome cartoons, cool sketches, elegant vector art, and so much more. Experience the enchanting magic of art with your own pictures, all powered by cutting-edge AI. Try it yourself and see the incredible results!

Key features of PortraitArt include:

  • Transform your photos into 12 different art styles
  • Choose from oil paintings, watercolors, sketches, line art, vector art, illustrations, cartoons, 3D art, vintage styles, vibrant pop art, and more
  • Personalize portraits, family photos, group photos, wedding photos, children photos, house photos, pet photos, car photos, scenery art, and more
  • Receive a notification email once your artwork is complete
  • Easy-to-use interface and quick rendering process

12

AICartoonGenerator

Transform your photos into captivating cartoons with the AI Cartoon Generator! Our innovative tool uses AI technology to magically turn your pictures into various cartoon styles, whether you prefer a realistic look or something cute and funny. Use these cartoon images for your social media profiles or simply enjoy the creative process - all at no cost!

Key features of AI Cartoon Generator include:

  • Revitalize your photos with our AI cartoon picture maker
  • Transform images using our free cartoon filter
  • Create cartoon avatars and characters effortlessly
  • Generate 3D cartoon characters and avatars with ease
  • Make unique profile pictures for social media with our AI tool

14

TurboType Banner

Check out our YouTube for AI news & in-depth tutorials!