LLM Encoder vs Endcoder Decoder

China's Z.ai claims it trained a model using only Huawei hardware

Chinese outfit Zhipu AI claims it trained a new model entirely using Huawei hardware, and that it’s the first company to ...

New Apple model combines vision understanding and image generation with impressive results

Manzano combines visual understanding and text-to-image generation, while significantly reducing performance or quality trade-offs.

Apple AI research shows how MLLMs understand, generate, search for images

Apple's researchers continue to focus on multimodal LLMs, with studies exploring their use for image generation, ...

5don MSN

GLM-Image explained: Huawei-powered AI that seriously challenges Nvidia, here’s how

For the past few years, a single axiom has ruled the generative AI industry: if you want to build a state-of-the-art model, ...

IEEE

GiVE: Guiding Visual Encoder to Perceive Overlooked Information

Abstract: Multimodal Large Language Models have advanced AI in applications like text-to-video generation and visual question answering. These models rely on visual encoders to convert non-text data ...

GitHub

Exploring the Potential of Encoder-free Architectures in 3D LMMs

Official repository for the paper "Exploring the Potential of Encoder-free Architectures in 3D LMMs". The encoder-free 3D LMM directly utilizes a token embedding module to convert point cloud data ...

IEEE

DE-Unet: Dual-Encoder U-Net for Ultra-High Resolution Remote Sensing Image Segmentation

Abstract: In recent years, there has been a growing demand for remote sensing image semantic segmentation in various applications. The key to semantic segmentation lies in the ability to globally ...

GitHub

nefelibatawht/happy-llm_study

本项目适合大学生、研究人员、LLM 爱好者。在学习本项目之前，建议具备一定的编程经验，尤其是要对 Python ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results