The company has been steadily releasing research papers on its AI work for years. With Ferret-UI, the company seems to be looking at how AI can help with smartphone navigation: Automating this process ...
Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More With little fanfare, researchers from Apple and Columbia University ...
Apple's Ferret LLM could help allow Siri to understand the layout of apps in an iPhone display, potentially increasing the capabilities of Apple's digital assistant. The paper, released by Cornell ...
Artificial intelligence researchers from Apple Inc. and Cornell University quietly unveiled an open-source and multimodal large language model last October known as Ferret, which is said to use parts ...
Apple has revealed its latest development in artificial intelligence (AI) large language model (LLM), introducing the MM1 family of multimodal models capable of interpreting both images and text data.
Meta AI has unveiled the Llama 3.2 model series, a significant milestone in the development of open-source multimodal large language models (LLMs). This series encompasses both vision and text-only ...
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...
Transformer-based models have rapidly spread from text to speech, vision, and other modalities. This has created challenges for the development of Neural Processing Units (NPUs). NPUs must now ...