MIT and Nvidia unveil fast hybrid AI image generator

digitaltrends.com

Researchers at MIT and Nvidia have developed a new AI image generation tool called HART. This tool is designed to create images much faster than current technology while using fewer computing resources. HART combines two popular methods of AI image generation: diffusion models and auto-regressive models. The traditional diffusion method produces high-quality images but is slow and requires a lot of computing power. The auto-regressive model is quicker, but it can make more mistakes. By merging these techniques, HART can generate images in just eight steps, compared to the usual two dozen steps. In tests, HART generated an image of a parrot playing a bass guitar in about one second. This is significantly faster than other models, which took around 9 to 10 seconds. HART also needs 31% less computing power, allowing it to run on regular laptops and even mobile phones, unlike many other AI tools that rely on cloud computing. The images created by HART are impressive, with good detail and stylistic variety, matching the quality of leading models. It can produce images with a resolution of 1024 x 1024 pixels. The technology has potential future applications, like combining image generation with language models for more interactive experiences. Despite its fast performance, HART still faces some challenges. It sometimes struggles with certain details, such as correctly depicting numbers or maintaining character consistency. However, these issues are common among AI image generators. Overall, HART shows great promise in the field of AI image creation. Many are looking forward to seeing how MIT and Nvidia might develop this technology further or integrate it into existing products.


With a significance score of 3.8, this news ranks in the top 14% of today's 29036 analyzed articles.

Get summaries of news with significance over 5.5 (usually ~10 stories per week). Read by 9500 minimalists.