


default search action
Senthooran Rajamanoharan
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c4]Javier Ferrando, Oscar Balcells Obeso, Senthooran Rajamanoharan, Neel Nanda:
Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models. ICLR 2025
[c3]Subhash Kantamneni, Joshua Engels, Senthooran Rajamanoharan, Max Tegmark, Neel Nanda:
Are Sparse Autoencoders Useful? A Case Study in Sparse Probing. ICML 2025
[i16]Subhash Kantamneni, Joshua Engels, Senthooran Rajamanoharan, Max Tegmark, Neel Nanda:
Are Sparse Autoencoders Useful? A Case Study in Sparse Probing. CoRR abs/2502.16681 (2025)
[i15]Iván Arcuschin, Jett Janiak, Robert Krzyzanowski, Senthooran Rajamanoharan, Neel Nanda, Arthur Conmy:
Chain-of-Thought Reasoning In The Wild Is Not Always Faithful. CoRR abs/2503.08679 (2025)
[i14]Rohin Shah, Alex Irpan, Alexander Matt Turner, Anna Wang, Arthur Conmy, David Lindner, Jonah Brown-Cohen, Lewis Ho, Neel Nanda, Raluca Ada Popa, Rishub Jain, Rory Greig, Samuel Albanie, Scott Emmons, Sebastian Farquhar, Sébastien Krier, Senthooran Rajamanoharan, Sophie Bridgers, Tobi Ijitoye, Tom Everitt, Victoria Krakovna, Vikrant Varma, Vladimir Mikulik, Zachary Kenton, Dave Orr, Shane Legg, Noah D. Goodman, Allan Dafoe, Four Flynn, Anca D. Dragan:
An Approach to Technical AGI Safety and Security. CoRR abs/2504.01849 (2025)
[i13]Bartosz Cywinski, Emil Ryd, Senthooran Rajamanoharan, Neel Nanda:
Towards eliciting latent knowledge from LLMs with mechanistic interpretability. CoRR abs/2505.14352 (2025)
[i12]Edward Turner, Anna Soligo, Mia Taylor, Senthooran Rajamanoharan, Neel Nanda:
Model Organisms for Emergent Misalignment. CoRR abs/2506.11613 (2025)
[i11]Anna Soligo, Edward Turner, Senthooran Rajamanoharan, Neel Nanda:
Convergent Linear Representations of Emergent Misalignment. CoRR abs/2506.11618 (2025)
[i10]Xiaoqing Sun, Alessandro Stolfo, Joshua Engels, Ben Wu, Senthooran Rajamanoharan, Mrinmaya Sachan, Max Tegmark:
Dense SAE Latents Are Features, Not Bugs. CoRR abs/2506.15679 (2025)
[i9]Scott Emmons, Erik Jenner, David K. Elson, Rif A. Saurous, Senthooran Rajamanoharan, Heng Chen, Irhum Shafkat, Rohin Shah:
When Chain of Thought is Necessary, Language Models Struggle to Evade Monitors. CoRR abs/2507.05246 (2025)
[i8]Atticus Wang, Joshua Engels, Oliver Clive-Griffin, Senthooran Rajamanoharan, Neel Nanda:
Simple Mechanistic Explanations for Out-Of-Context Reasoning. CoRR abs/2507.08218 (2025)
[i7]Helena Casademunt, Caden Juang, Adam Karvonen, Samuel Marks, Senthooran Rajamanoharan, Neel Nanda:
Steering Out-of-Distribution Generalization with Concept Ablation Fine-Tuning. CoRR abs/2507.16795 (2025)
[i6]Bartosz Cywinski, Emil Ryd, Rowan Wang, Senthooran Rajamanoharan, Neel Nanda, Arthur Conmy, Samuel Marks:
Eliciting Secret Knowledge from Language Models. CoRR abs/2510.01070 (2025)
[i5]Uzay Macar, Paul C. Bogdan, Senthooran Rajamanoharan, Neel Nanda:
Thought Branches: Interpreting LLM Reasoning Requires Resampling. CoRR abs/2510.27484 (2025)- 2024
[c2]Senthooran Rajamanoharan, Arthur Conmy, Lewis Smith, Tom Lieberum, Vikrant Varma, János Kramár, Rohin Shah, Neel Nanda:
Improving Sparse Decomposition of Language Model Activations with Gated Sparse Autoencoders. NeurIPS 2024
[i4]Senthooran Rajamanoharan, Arthur Conmy, Lewis Smith, Tom Lieberum, Vikrant Varma, János Kramár, Rohin Shah, Neel Nanda:
Improving Dictionary Learning with Gated Sparse Autoencoders. CoRR abs/2404.16014 (2024)
[i3]Senthooran Rajamanoharan, Tom Lieberum, Nicolas Sonnerat, Arthur Conmy, Vikrant Varma, János Kramár, Neel Nanda:
Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse Autoencoders. CoRR abs/2407.14435 (2024)
[i2]Tom Lieberum, Senthooran Rajamanoharan, Arthur Conmy, Lewis Smith, Nicolas Sonnerat, Vikrant Varma, János Kramár, Anca D. Dragan, Rohin Shah, Neel Nanda:
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2. CoRR abs/2408.05147 (2024)
[i1]Javier Ferrando, Oscar Obeso, Senthooran Rajamanoharan, Neel Nanda:
Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models. CoRR abs/2411.14257 (2024)
2000 – 2009
- 2008
[j1]Lars Blackmore, Senthooran Rajamanoharan, Brian C. Williams:
Active Estimation for Jump Markov Linear Systems. IEEE Trans. Autom. Control. 53(10): 2223-2236 (2008)- 2006
[c1]Lars Blackmore, Senthooran Rajamanoharan, Brian C. Williams:
Active Estimation for Switching Linear Dynamic Systems. CDC 2006: 137-144
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-02-05 23:43 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







