Stealing Part of a Production Language Model (2024)
We introduce the first model-stealing attack that extracts precise, nontrivial information from black-box production language models like OpenAI’s ChatGPT or Google’s PaLM-2. Specifically, our attack recovers the embedding projection layer (up to symmetries) of a transformer model, given typical API access. For under $20 USD, our attack extracts the entire projection matrix of OpenAI’s ada and babbage language models. We thereby confirm, for the first time, that these black-box … ⌘ Read more
N3694: Functions with Data - Closures in C (A Comprehensive Proposal Overviewing Blocks, Nested Functions, and Lambdas)
Comments ⌘ Read more
Python 3.14.0 (final) is here
This is the stable release of Python 3.14.0
Python 3.14.0, the newest major release of the Python programming language is here!
AI and Home-Cooked Software
https://mrkaran.dev/posts/ai-home-cooked-software/
What are you doing this week?
What are you doing this week? Feel free to share!
Keep in mind it’s OK to do nothing at all, too. ⌘ Read more
A case for learning GPU programming with a compute-first mindset – Maister’s Graphics Adventures
Comments ⌘ Read more