Archive for Monday, 15th June 2026

Monday, 15th June 2026

Sighting 5:38 PM — California Brown Pelican, in Monterey Bay National Marine Sanctuary, CA, US, CA

15th Jun 2026

[...] Instead, I picture a specific person and I just write for them. Often this person is "me, but 3 years ago" or a good friend.

— Julia Evans, write for 1 person

# 2:05 am / writing, julia-evans

“They screwed us”: Personality clashes sent Anthropic’s models offline. Lots of "source familiar with the administration's thinking" and "source close to Anthropic" in this Axios piece, which is the best collection of behind-the-scenes gossip I've seen about the US government export control Mythos/Fable story so far.

Logan Graham (I lead the Frontier Red Team at Anthropic), Dave Orr (Head of Safeguards, previously a Director of Engineering at Google DeepMind), and blog favorite Nicholas Carlini are reported to be meeting with the Commerce Department today in D.C. Good luck to them!

(I just noticed Logan was "Special Adviser to the Prime Minister" in the Boris Johnson era, covering AI, science, and technology policy - so significant political experience.)

This closing note doesn't give me much optimism that we'll be getting Fable back any time soon:

The bottom line: One option is to make sure Anthropic's models can't be jailbroken — though perfect jailbreak resistance may be impossible.

Absent that, a source familiar with the administration's thinking said it may simply come down to an attitude fix where, instead of feeling dismissed, "everyone feels safe, secure and happy."

This made me wonder if Anthropic ever successfully addressed the class of attacks described in the Universal and Transferable Adversarial Attacks on Aligned Language Models paper from 2023.

It looks like their Constitutional Classifiers work (that post is from January this year) is relevant to that. They continue to claim that no "universal jailbreak" has been found against Claude Mythos, classifying the jailbreak that triggered the US government response as "a potential narrow, non-universal jailbreak".

# 2:57 pm / jailbreaking, ai, generative-ai, llms, anthropic, claude, nicholas-carlini, ai-ethics, claude-mythos-fable

Release datasette-agent 0.3a0

New tool, execute_write_sql, which requests user approval and then writes to a database - taking user permissions into account. #27

I added a mechanism for asking user approval in datasette agent 0.2a0. The new execute_write_sql tool can now prompt the user for all kinds of useful operations. Here's an example where I add some pelican sightings to my pelican_sightings table:

The new version also enhances the datasette agent chat terminal mode to support approvals, and adds several new options including --unsafe mode for auto-approving them:

datasette agent chat can execute tools that require user approval. #30

Three new options for datasette agent chat - --root to run as root, --yes to approve all ask user questions, and --unsafe for both.

Tools can now provide plain text alternatives to HTML, for display in the datasette agent chat CLI. #31

The datasette agent chat content.db -m gpt-5.5 --unsafe command can now be used to chat directly with a specific database and directly modify it through prompts like "create a notes table", "add a note about X" etc.

15th Jun 2026, 5:19 pm · projects, ai, datasette, annotated-release-notes, generative-ai, llms, llm-tool-use, datasette-agent

Release datasette-apps 0.1a2

Custom network/CSP origins for apps are now guarded by a new apps-set-csp permission, with an optional allowed_csp_origins plugin allow-list for non-privileged users. The Datasette Agent app creation tool enforces the same rules. #24

Stored query picker now supports keyboard navigation and shows the three most recent accessible stored queries when focused.

#fragment links inside apps are no longer intercepted by the external-link confirmation modal. #23

Fixed link confirmation modal and logging panels in ?full=1 full-screen mode. #26

15th Jun 2026, 5:26 pm · datasette, datasette-apps

Release datasette-apps 0.1a3

Fixed a bug where users without the create-app permission could still create apps. #27

Fixed a bug where it was impossible to grant permission to edit an app to users who were not the app's owner. The rules for edit/delete are now the same as view: if the app is private only the owner can modify it, otherwise permission is controlled by Datasette's regular permission system. #29

15th Jun 2026, 8:25 pm · datasette, datasette-apps

← Sunday, 14th June 2026

Tuesday, 16th June 2026 →

Simon Willison’s Weblog

Monday, 15th June 2026