Show HN: I extracted the safety filters from Apple Intelligence models

Hacker News - AI
Jul 6, 2025 19:50
BlueFalconHD
1 views
hackernewsaidiscussion

Summary

A developer has reverse engineered and extracted the safety filters used in Apple Intelligence models, making them publicly available in a repository. This move enables greater transparency and scrutiny of Apple's AI safety mechanisms, potentially informing future research and discussions on AI model safety and security.

I managed to reverse engineer the encryption (refered to as “Obfuscation” in the framework) responsible for managing the safety filters of Apple Intelligence models. I have extracted them into a repository. I encourage you to take a look around. Comments URL: https://news.ycombinator.com/item?id=44483485 Points: 1 # Comments: 0