We're sorry but this page doesn't work properly without JavaScript enabled. Please enable it to continue.
Feedback

More Than Pixels - Unlock your image data with Vision-Language Models

Formal Metadata

Title
More Than Pixels - Unlock your image data with Vision-Language Models
Title of Series
Number of Parts
18
Author
Contributors
License
CC Attribution 4.0 International:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.
Identifiers
Publisher
Release Date
Language

Content Metadata

Subject Area
Genre
Abstract
Join us on two Vision-Language Adventures! We'll uncover the information hidden inside big image collections with Vision-Language Models (VLMs) showing us the way. Who knows which forgotten gems await us? In the first part, we'll use CLIP and FAISS to go on a treasure hunt in your photo collection. You'll learn how to filter through millions of images in a breeze, using natural language. Bye-bye endless scrolling, hour-long tagging, and frustrated folder searching