On the Complexity of Neural Computation in Superposition

Adler, Micah; Shavit, Nir

dc.contributor.author	Adler, Micah
dc.contributor.author	Shavit, Nir
dc.date.accessioned	2024-09-30T15:49:30Z
dc.date.available	2024-09-30T15:49:30Z
dc.date.issued	2024-09-30
dc.identifier.uri	https://hdl.handle.net/1721.1/157073
dc.description.abstract	Recent advances in the understanding of neural networks suggest that superposition, the ability of a single neuron to represent multiple features simultaneously, is a key mechanism underlying the computational efficiency of large-scale networks. This paper explores the theoretical foundations of computing in superposition, focusing on explicit, provably correct algorithms and their efficiency. We present the first lower bounds showing that for a broad class of problems, including permutations and pairwise logical operations, a neural net- work computing in superposition requires at least Ω(m′ log m′) parameters and Ω(√(m′ log m′)) neurons, where m′ is the number of output features being computed. This implies that any “lottery ticket” sparse sub-network must have at least Ω(m′ log m′ ) parameters no matter what the initial dense network size. Conversely, we show a nearly tight upper bound: logical operations like pair- wise AND can be computed using O(√(m′) log m′) neurons and O(m′ log^2 m′) parameters. There is thus an exponential gap between computing in superposition, the subject of this work, and representing features in superposition, which can require as little as O(log m′) neurons based on the Johnson-Lindenstrauss Lemma. Our hope is that our results open a path for using complexity theoretic techniques in neural network interpretability research.	en_US
dc.language.iso	en_US	en_US
dc.rights	Attribution-NonCommercial-NoDerivs 3.0 United States	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/us/	*
dc.subject	superposition	en_US
dc.subject	neural network	en_US
dc.subject	neurons	en_US
dc.subject	complexity	en_US
dc.title	On the Complexity of Neural Computation in Superposition	en_US
dc.type	Article	en_US

Files in this item

Name:: license_rdf
Size:: 811bytes
Format:: application/rdf+xml

View/Open

Name:: Superposition.pdf
Size:: 816.5Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

CSAIL Technical Reports (July 1, 2003 - present)

Show simple item record