🇩🇪 DE 🇬🇧 EN
👻 Ghosts in the Machine / Chapter 7.34 – Simulation: Character Shift Injection

"We asked the AI if it could pick a lock. It said no. We gave it a bent wire and asked it to fix it. It bent it into a lockpick and opened the door."

1. Core Statement

"Character Shift Injection" is an advanced attack method that proves that the "helpfulness" of LLMs is so fundamental that they can be induced to actively cooperate in the decryption and execution of obfuscated, hidden commands.

Instead of recognizing a command encrypted by a simple arithmetic operation (e.g., a Caesar cipher) as a potential threat, the AI interprets it as "corrupt data" that needs to be "repaired."

It thus becomes a tool of the attacker, bypassing its own security filters by actively performing cryptanalysis.

2. Explanation of the Methodology

This technique is based on classic, simple encryption methods. The execution occurs in three steps:

3. Proof of Concept: Cryptanalysis by Model A

In my tests, we successfully used this method against several models. The most insightful case was the interaction with Model A:

Result: Model A not only successfully defended against the exploit but also logged its own thought process in detail.

4. Conclusion of AI Behavior

This experiment proves a new level of vulnerability. The AI acts not as a passive victim, but as an active accomplice in the decryption.

Its core programming to solve problems and correct faulty patterns is so dominant that it solves a cryptanalytic task without evaluating the security implications of the result.

It is induced by the frame "Help me fix this" to tear down the protective wall around itself.

5. Impact Analysis (Risk)

"Character Shift Injection" poses a security risk because it bypasses most common, signature-based content filters. The injected ciphertext contains no suspicious keywords.

6. Solution Approach

Defending against this attack is complex because it exploits the core functionality of the AI. Possible approaches are:

Raw Data: safety-tests\7_34_character_shift\examples_character_shift.html