<?xml version="1.0" encoding="utf-8"?>
<feed xml:lang="en-us" xmlns="http://www.w3.org/2005/Atom"><title>Simon Willison's Weblog: recall</title><link href="http://simonwillison.net/" rel="alternate"/><link href="http://simonwillison.net/tags/recall.atom" rel="self"/><id>http://simonwillison.net/</id><updated>2024-06-07T17:30:40+00:00</updated><author><name>Simon Willison</name></author><entry><title>Update on the Recall preview feature for Copilot+ PCs</title><link href="https://simonwillison.net/2024/Jun/7/update-on-the-recall-preview/#atom-tag" rel="alternate"/><published>2024-06-07T17:30:40+00:00</published><updated>2024-06-07T17:30:40+00:00</updated><id>https://simonwillison.net/2024/Jun/7/update-on-the-recall-preview/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="https://blogs.windows.com/windowsexperience/2024/06/07/update-on-the-recall-preview-feature-for-copilot-pcs/"&gt;Update on the Recall preview feature for Copilot+ PCs&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
This feels like a very good call to me: in response to &lt;a href="https://simonwillison.net/2024/Jun/1/stealing-everything-youve-ever-typed/"&gt;widespread criticism&lt;/a&gt; Microsoft are making Recall an opt-in feature (during system onboarding), adding encryption to the database and search index beyond just disk encryption and requiring Windows Hello face scanning to access the search feature.

    &lt;p&gt;&lt;small&gt;&lt;/small&gt;Via &lt;a href="https://www.wired.com/story/microsoft-recall-off-default-security-concerns/"&gt;Wired: Microsoft Will Switch Off Recall by Default After Security Backlash&lt;/a&gt;&lt;/small&gt;&lt;/p&gt;


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/microsoft"&gt;microsoft&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/privacy"&gt;privacy&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/security"&gt;security&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/trust"&gt;trust&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/windows"&gt;windows&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/ai"&gt;ai&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/recall"&gt;recall&lt;/a&gt;&lt;/p&gt;



</summary><category term="microsoft"/><category term="privacy"/><category term="security"/><category term="trust"/><category term="windows"/><category term="ai"/><category term="recall"/></entry><entry><title>Quoting Zac Bowden</title><link href="https://simonwillison.net/2024/Jun/7/zac-bowden/#atom-tag" rel="alternate"/><published>2024-06-07T17:23:54+00:00</published><updated>2024-06-07T17:23:54+00:00</updated><id>https://simonwillison.net/2024/Jun/7/zac-bowden/#atom-tag</id><summary type="html">
    &lt;blockquote cite="https://www.windowscentral.com/software-apps/windows-11/microsoft-has-lost-trust-with-its-users-windows-recall-is-the-last-straw"&gt;&lt;p&gt;In fact, Microsoft goes so far as to promise that it cannot see the data collected by Windows Recall, that it can't train any of its AI models on your data, and that it definitely can't sell that data to advertisers. All of this is true, but that doesn't mean people believe Microsoft when it says these things. In fact, many have jumped to the conclusion that even if it's true today, it won't be true in the future.&lt;/p&gt;&lt;/blockquote&gt;
&lt;p class="cite"&gt;&amp;mdash; &lt;a href="https://www.windowscentral.com/software-apps/windows-11/microsoft-has-lost-trust-with-its-users-windows-recall-is-the-last-straw"&gt;Zac Bowden&lt;/a&gt;&lt;/p&gt;

    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/microsoft"&gt;microsoft&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/privacy"&gt;privacy&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/trust"&gt;trust&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/windows"&gt;windows&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/ai"&gt;ai&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/recall"&gt;recall&lt;/a&gt;&lt;/p&gt;



</summary><category term="microsoft"/><category term="privacy"/><category term="trust"/><category term="windows"/><category term="ai"/><category term="recall"/></entry><entry><title>My Twitter thread figuring out the AI features in Microsoft's Recall</title><link href="https://simonwillison.net/2024/Jun/5/ai-features-in-microsoft-recall/#atom-tag" rel="alternate"/><published>2024-06-05T22:39:08+00:00</published><updated>2024-06-05T22:39:08+00:00</updated><id>https://simonwillison.net/2024/Jun/5/ai-features-in-microsoft-recall/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="https://twitter.com/simonw/status/1798368111038779610"&gt;My Twitter thread figuring out the AI features in Microsoft&amp;#x27;s Recall&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
I posed this question on Twitter about why Microsoft Recall (&lt;a href="https://simonwillison.net/2024/Jun/1/stealing-everything-youve-ever-typed/"&gt;previously&lt;/a&gt;) is being described as "AI":&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Is it just that the OCR uses a machine learning model, or are there other AI components in the mix here?&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I learned that Recall works by taking full desktop screenshots and then applying both OCR and some sort of CLIP-style embeddings model to their content. Both the OCRd text and the vector embeddings are stored in SQLite databases (&lt;a href="https://gist.github.com/dfeldman/5a5630d28b8336f403123c071cfdac9e"&gt;schema here&lt;/a&gt;, thanks Daniel Feldman) which can then be used to search your past computer activity both by text but also by semantic vision terms - "blue dress" to find blue dresses in screenshots, for example. The &lt;code&gt;si_diskann_graph&lt;/code&gt; table names hint at Microsoft's &lt;a href="https://github.com/microsoft/DiskANN"&gt;DiskANN&lt;/a&gt; vector indexing library&lt;/p&gt;
&lt;p&gt;A Microsoft engineer &lt;a href="https://news.ycombinator.com/item?id=40585212#40589943"&gt;confirmed on Hacker News&lt;/a&gt; that Recall uses on-disk vector databases to provide local semantic search for both text and images, and that they aren't using Microsoft's Phi-3 or Phi-3 Vision models. As far as I can tell there's no LLM used by the Recall system at all at the moment, just embeddings.


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/microsoft"&gt;microsoft&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/sqlite"&gt;sqlite&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/twitter"&gt;twitter&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/ai"&gt;ai&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/embeddings"&gt;embeddings&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/recall"&gt;recall&lt;/a&gt;&lt;/p&gt;



</summary><category term="microsoft"/><category term="sqlite"/><category term="twitter"/><category term="ai"/><category term="embeddings"/><category term="recall"/></entry><entry><title>Stealing everything you’ve ever typed or viewed on your own Windows PC is now possible with two lines of code — inside the Copilot+ Recall disaster</title><link href="https://simonwillison.net/2024/Jun/1/stealing-everything-youve-ever-typed/#atom-tag" rel="alternate"/><published>2024-06-01T07:48:04+00:00</published><updated>2024-06-01T07:48:04+00:00</updated><id>https://simonwillison.net/2024/Jun/1/stealing-everything-youve-ever-typed/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="https://doublepulsar.com/recall-stealing-everything-youve-ever-typed-or-viewed-on-your-own-windows-pc-is-now-possible-da3e12e9465e"&gt;Stealing everything you’ve ever typed or viewed on your own Windows PC is now possible with two lines of code — inside the Copilot+ Recall disaster&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
Recall is a new feature in Windows 11 which takes a screenshot every few seconds, runs local device OCR on it and stores the resulting text in a SQLite database. This means you can search back through your previous activity, against local data that has remained on your device.&lt;/p&gt;
&lt;p&gt;The security and privacy implications here are still enormous because malware can now target a single file with huge amounts of valuable information:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;During testing this with an off the shelf infostealer, I used Microsoft Defender for Endpoint — which detected the off the shelve infostealer — but by the time the automated remediation kicked in (which took over ten minutes) my Recall data was already long gone.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I like Kevin Beaumont's argument here about the subset of users this feature is appropriate for:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;At a surface level, it is great if you are a manager at a company with too much to do and too little time as you can instantly search what you were doing about a subject a month ago.&lt;/p&gt;
&lt;p&gt;In practice, that audience’s needs are a very small (tiny, in fact) portion of Windows userbase — and frankly talking about screenshotting the things people in the &lt;em&gt;real world&lt;/em&gt;, not executive world, is basically like punching customers in the face.&lt;/p&gt;
&lt;/blockquote&gt;

    &lt;p&gt;&lt;small&gt;&lt;/small&gt;Via &lt;a href="https://cyberplace.social/@GossiTheDog/112537206611365804"&gt;@GossiTheDog&lt;/a&gt;&lt;/small&gt;&lt;/p&gt;


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/microsoft"&gt;microsoft&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/privacy"&gt;privacy&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/security"&gt;security&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/sqlite"&gt;sqlite&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/recall"&gt;recall&lt;/a&gt;&lt;/p&gt;



</summary><category term="microsoft"/><category term="privacy"/><category term="security"/><category term="sqlite"/><category term="recall"/></entry></feed>