Apple releases a new open source machine learning model: “Ferret”
What is Ferret? Ferret is a multimodal large-scale language model capable of referring and locating objects based on instructions. This model combines regional representation and visual sampling to achieve precise reference and positioning. It was trained on the GRIT dataset and has an evaluation benchmark named Ferret-Bench. The code and checkpoints for Ferret are available…