Why screenshots could be a crucial tool for making AI work

If you want to achieve the maximum benefit from a world full of artificial intelligence tools, here is the development: Start taking footage. Many screen shots. From anything and everything. Because despite all the talk about the sound modes and cameras everywhere The multimedia future for everythingThere may be no more valuable digital behavior than the buttons and save what you are looking for.
Screen footage is the most comprehensive way to capture digital information. You can pick up anything – well, almost anything, Thank you very much, Netflix! – With a few clicks, save and share it in almost any device, application or person. “It is a portable data format,” says Johnny Berri, founder of the digital storage application. fabric. “There is nothing else that you can move between any program.”
Screen snapshot contains a lot of information, such as its source and contents and even today at the angle of the screen. The most important of all, it sends a decisive and complex signal; He says I care about this. We have countless new artificial intelligence tools that aim to watch the world, our lives and everything, and try to understand everything for us. These tools are often foolish for many reasons, but mostly because artificial intelligence is very good in knowing things, but they are nonsense in knowing whether they are important. Screen snapshot helps value and tell the system he needs to pay attention.
The screenshots also, used, put in control of an important way. “If you give you access to all my email messages, all I have WhatsApps, everything, there is a lot of noise,” says Mattias Deserti, Smart phone marketing head in nothing. Simply put, there is no reason to save every email you receive or every web page you visit – and this does not mean anything about the effects of privacy. “So, what if you can instead start training the system yourself, and feed the information system for you Want The system to know about you? ” instead of A tool like Microsoft RecallAnd who asks for unlimited access to everything, starting with screenshots allows you to choose what you share.
To date, screenshots have been a fairly sharp tool. You can pick up one, and it is saved in your camera roll, where you are likely to forget, until the end of the time. (And do not make me start in all the screenshots that I take by chance, most of them from the lock screen.) At best, you may be able to search for some texts inside the image. But it is likely that you only have to pass until you find it again.
The first step in making more useful footage is to know what is actually
The first step in making screenshots more useful is to know what is actually. This, initially shy, terrifyingly uncomplicated: the technique of identifying visual letters has long been a good job in discovering the text on the page. Artificial intelligence models take this step forward, so that you can either search the title or just “movies” to find all digital shots of stickers, Fandango results, TIKTOK recommendations and more. “We use the OCR,” says Shamaz Zack, Google’s producer manager and part of the team behind the team. Pixel clips app. “Then we use a model to detect the entity, then Gemini to understand the actual context of the screen.”
See, there is much more than a screen snapshot than just the text inside. The correct artificial intelligence model should be able to know that it came from WhatsApp, only in the specified green. He should be able to locate the web through his head logo or understand it when saving the name Spotify song or Help Handyman or Amazon menu. Armed with this information, the screenshot app may start organizing all these images automatically. And even this is just the beginning.
With everything you described so far, all we really created is a very good application to consider your screenshots, which no one really thinks is a good idea because it will be just something else to verify – or forget to verify. When it is very interestingly more interesting, your device or your application can start using screenshots on your behalf, to help you remember what you picked up or even use this information to accomplish things.
In nothing Apply a new basic spaceFor example, the application can create reminders based on the things you memorize. If you take a screenshot of a concert you want to go to, you can remind you that it will appear automatically. Pixel screen shots pay the idea to the further: If you save the list of concerts, your Pixel can ask you to listen to that band the next time that Spotify opens. If you photographed an identity card or pass to the plane, you may ask you to put it in the portfolio application. Zach says the idea is to think about screenshots as an entry system for everything else.
Mike Choi, an independent developer, built an application called called camp Part of it to help him benefit from his own shots. Work began to convert each screenshot into a “card”, while storing prominent information alongside the image. “You have a screenshot, and below there is a button, it turns the card,” he says. “It shows you a map, if it is a site; a preview of a song, if it is a song. The idea was, given an endless collection of different types of screen shots, can AI create the perfect user interface for that category while flying?”
If all of this looks familiar, it is because there is another term for what is going on here: It is called Agentic AI. Each technology company appears to work on ways to use artificial intelligence to accomplish things on your behalf. Just, in this case, you should not write long claims or chat back and forth with an auxiliary. You just take a screenshot and let the system go to work. “You are building a base for knowledge, when the base of knowledge today is limited to your exhibition and nothing happens with it,” says Destti. It is excited to get to the point where you can a screenshot of the concert date, and the basic space calls you automatically to purchase tickets when it is offered for sale.
Understanding screenshots is not always very clear
Understanding screenshots is not always clear. Some you want to keep forever, such as the ID card that you may need often; Other things, such as a concert poster or a parking pass, have a very limited life. For this issue, how is the application supposed to distinguish between the parking pass that you use every day at work and those that you used once at the airport and do not need again? Some screenshots have been sent on my phone on WhatsApp; Others grabbed Instagram Mitams to send to friends. The camera menu should not be completely kept against them, and the same applies to the screenshots. Many of these screenshot apps are looking for ways to demand you to add a note, or organize things yourself, in order to provide some additional useful information to the system. But it is difficult to do this without destroying what makes screen clips smooth and easy in the first place.
One way to start solving this problem is to make screen clips more useful, collecting some additional context of your device. This is where companies such as Google have no advantage: because they make the device, they can see everything that happens when picking a screenshot. If you pick a screenshot of your web browser, they can also store the link you were looking for. They can also see your actual location or note time and weather. Sometimes all of this is useful, but sometimes this is nonsense; The higher the number of data they collect, the greater the risk of these applications in the same noise problem that the screen shots helped solve in the first place.
But the input system works. We all capture screenshots, all the time, and we are used to taking them as a way to mark many types of useful information. Access to this type of relevant and personal data is the most difficult thing in building a great annexation for Amnesty International. The future of multimedia computing, including cameras, microphones and sensors of all kinds. But the first way to use artificial intelligence may be one screenshot at one time.