default search action
Albert Gu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c31]Haozhe Shan, Albert Gu, Zhong Meng, Weiran Wang, Krzysztof Choromanski, Tara N. Sainath:
Augmenting Conformers With Structured State-Space Sequence Models For Online Speech Recognition. ICASSP 2024: 12221-12225 - [c30]Tri Dao, Albert Gu:
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality. ICML 2024 - [c29]Yair Schiff, Chia-Hsiang Kao, Aaron Gokaslan, Tri Dao, Albert Gu, Volodymyr Kuleshov:
Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling. ICML 2024 - [i34]Soham De, Samuel L. Smith, Anushan Fernando, Aleksandar Botev, George-Cristian Muraru, Albert Gu, Ruba Haroun, Leonard Berrada, Yutian Chen, Srivatsan Srinivasan, Guillaume Desjardins, Arnaud Doucet, David Budden, Yee Whye Teh, Razvan Pascanu, Nando de Freitas, Caglar Gulcehre:
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models. CoRR abs/2402.19427 (2024) - [i33]Yair Schiff, Chia-Hsiang Kao, Aaron Gokaslan, Tri Dao, Albert Gu, Volodymyr Kuleshov:
Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling. CoRR abs/2403.03234 (2024) - [i32]Tri Dao, Albert Gu:
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality. CoRR abs/2405.21060 (2024) - [i31]Roger Waleffe, Wonmin Byeon, Duncan Riach, Brandon Norick, Vijay Korthikanti, Tri Dao, Albert Gu, Ali Hatamizadeh, Sudhakar Singh, Deepak Narayanan, Garvit Kulshreshtha, Vartika Singh, Jared Casper, Jan Kautz, Mohammad Shoeybi, Bryan Catanzaro:
An Empirical Study of Mamba-based Language Models. CoRR abs/2406.07887 (2024) - [i30]Sukjun Hwang, Aakash Lahoti, Tri Dao, Albert Gu:
Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers. CoRR abs/2407.09941 (2024) - [i29]Aviv Bick, Kevin Y. Li, Eric P. Xing, J. Zico Kolter, Albert Gu:
Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models. CoRR abs/2408.10189 (2024) - [i28]Ricardo Buitrago Ruiz, Tanya Marwah, Albert Gu, Andrej Risteski:
On the Benefits of Memory for Modeling Time-Dependent PDEs. CoRR abs/2409.02313 (2024) - 2023
- [c28]Junxiong Wang, Jing Nathan Yan, Albert Gu, Alexander M. Rush:
Pretraining Without Attention. EMNLP (Findings) 2023: 58-69 - [c27]Albert Gu, Isys Johnson, Aman Timalsina, Atri Rudra, Christopher Ré:
How to Train your HIPPO: State Space Models with Generalized Orthogonal Basis Projections. ICLR 2023 - [c26]David M. Knigge, David W. Romero, Albert Gu, Efstratios Gavves, Erik J. Bekkers, Jakub Mikolaj Tomczak, Mark Hoogendoorn, Jan-Jakob Sonke:
Modelling Long Range Dependencies in $N$D: From Task-Specific to a General Purpose CNN. ICLR 2023 - [c25]Antonio Orvieto, Samuel L. Smith, Albert Gu, Anushan Fernando, Çaglar Gülçehre, Razvan Pascanu, Soham De:
Resurrecting Recurrent Neural Networks for Long Sequences. ICML 2023: 26670-26698 - [c24]Chris Lu, Yannick Schroecker, Albert Gu, Emilio Parisotto, Jakob N. Foerster, Satinder Singh, Feryal M. P. Behbahani:
Structured State Space Models for In-Context Reinforcement Learning. NeurIPS 2023 - [i27]David M. Knigge, David W. Romero, Albert Gu, Efstratios Gavves, Erik J. Bekkers, Jakub M. Tomczak, Mark Hoogendoorn, Jan-Jakob Sonke:
Modelling Long Range Dependencies in N-D: From Task-Specific to a General Purpose CNN. CoRR abs/2301.10540 (2023) - [i26]Chris Lu, Yannick Schroecker, Albert Gu, Emilio Parisotto, Jakob N. Foerster, Satinder Singh, Feryal M. P. Behbahani:
Structured State Space Models for In-Context Reinforcement Learning. CoRR abs/2303.03982 (2023) - [i25]Antonio Orvieto, Samuel L. Smith, Albert Gu, Anushan Fernando, Çaglar Gülçehre, Razvan Pascanu, Soham De:
Resurrecting Recurrent Neural Networks for Long Sequences. CoRR abs/2303.06349 (2023) - [i24]Haozhe Shan, Albert Gu, Zhong Meng, Weiran Wang, Krzysztof Choromanski, Tara N. Sainath:
Augmenting conformers with structured state space models for online speech recognition. CoRR abs/2309.08551 (2023) - [i23]Albert Gu, Tri Dao:
Mamba: Linear-Time Sequence Modeling with Selective State Spaces. CoRR abs/2312.00752 (2023) - 2022
- [c23]Albert Gu, Karan Goel, Christopher Ré:
Efficiently Modeling Long Sequences with Structured State Spaces. ICLR 2022 - [c22]Karan Goel, Albert Gu, Chris Donahue, Christopher Ré:
It's Raw! Audio Generation with State-Space Models. ICML 2022: 7616-7633 - [c21]Ankit Gupta, Albert Gu, Jonathan Berant:
Diagonal State Spaces are as Effective as Structured State Spaces. NeurIPS 2022 - [c20]Albert Gu, Karan Goel, Ankit Gupta, Christopher Ré:
On the Parameterization and Initialization of Diagonal State Space Models. NeurIPS 2022 - [c19]Eric Nguyen, Karan Goel, Albert Gu, Gordon W. Downs, Preey Shah, Tri Dao, Stephen Baccus, Christopher Ré:
S4ND: Modeling Images and Videos as Multidimensional Signals with State Spaces. NeurIPS 2022 - [i22]Karan Goel, Albert Gu, Chris Donahue, Christopher Ré:
It's Raw! Audio Generation with State-Space Models. CoRR abs/2202.09729 (2022) - [i21]David W. Romero, David M. Knigge, Albert Gu, Erik J. Bekkers, Efstratios Gavves, Jakub M. Tomczak, Mark Hoogendoorn:
Towards a General Purpose CNN for Long Range Dependencies in ND. CoRR abs/2206.03398 (2022) - [i20]Albert Gu, Ankit Gupta, Karan Goel, Christopher Ré:
On the Parameterization and Initialization of Diagonal State Space Models. CoRR abs/2206.11893 (2022) - [i19]Albert Gu, Isys Johnson, Aman Timalsina, Atri Rudra, Christopher Ré:
How to Train Your HiPPO: State Space Models with Generalized Orthogonal Basis Projections. CoRR abs/2206.12037 (2022) - [i18]Eric Nguyen, Karan Goel, Albert Gu, Gordon W. Downs, Preey Shah, Tri Dao, Stephen A. Baccus, Christopher Ré:
S4ND: Modeling Images and Videos as Multidimensional Signals Using State Spaces. CoRR abs/2210.06583 (2022) - [i17]Junxiong Wang, Jing Nathan Yan, Albert Gu, Alexander M. Rush:
Pretraining Without Attention. CoRR abs/2212.10544 (2022) - 2021
- [c18]Karan Goel, Albert Gu, Yixuan Li, Christopher Ré:
Model Patching: Closing the Subgroup Performance Gap with Data Augmentation. ICLR 2021 - [c17]Ines Chami, Albert Gu, Dat Nguyen, Christopher Ré:
HoroPCA: Hyperbolic Dimensionality Reduction via Horospherical Projections. ICML 2021: 1419-1429 - [c16]Jared Quincy Davis, Albert Gu, Krzysztof Choromanski, Tri Dao, Christopher Ré, Chelsea Finn, Percy Liang:
Catformer: Designing Stable Transformers via Sensitivity Analysis. ICML 2021: 2489-2499 - [c15]Albert Gu, Isys Johnson, Karan Goel, Khaled Saab, Tri Dao, Atri Rudra, Christopher Ré:
Combining Recurrent, Convolutional, and Continuous-time Models with Linear State Space Layers. NeurIPS 2021: 572-585 - [i16]Ines Chami, Albert Gu, Dat Nguyen, Christopher Ré:
HoroPCA: Hyperbolic Dimensionality Reduction via Horospherical Projections. CoRR abs/2106.03306 (2021) - [i15]Albert Gu, Isys Johnson, Karan Goel, Khaled Saab, Tri Dao, Atri Rudra, Christopher Ré:
Combining Recurrent, Convolutional, and Continuous-time Models with Linear State-Space Layers. CoRR abs/2110.13985 (2021) - [i14]Albert Gu, Karan Goel, Christopher Ré:
Efficiently Modeling Long Sequences with Structured State Spaces. CoRR abs/2111.00396 (2021) - 2020
- [c14]Anna C. Gilbert, Albert Gu, Christopher Ré, Atri Rudra, Mary Wootters:
Sparse Recovery for Orthogonal Polynomial Transforms. ICALP 2020: 58:1-58:16 - [c13]Tri Dao, Nimit Sharad Sohoni, Albert Gu, Matthew Eichhorn, Amit Blonder, Megan Leszczynski, Atri Rudra, Christopher Ré:
Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps. ICLR 2020 - [c12]Albert Gu, Çaglar Gülçehre, Thomas Paine, Matt Hoffman, Razvan Pascanu:
Improving the Gating Mechanism of Recurrent Neural Networks. ICML 2020: 3800-3809 - [c11]Ines Chami, Albert Gu, Vaggos Chatziafratis, Christopher Ré:
From Trees to Continuous Embeddings and Back: Hyperbolic Hierarchical Clustering. NeurIPS 2020 - [c10]Albert Gu, Tri Dao, Stefano Ermon, Atri Rudra, Christopher Ré:
HiPPO: Recurrent Memory with Optimal Polynomial Projections. NeurIPS 2020 - [c9]Nimit Sharad Sohoni, Jared Dunnmon, Geoffrey Angus, Albert Gu, Christopher Ré:
No Subclass Left Behind: Fine-Grained Robustness in Coarse-Grained Classification Problems. NeurIPS 2020 - [i13]Karan Goel, Albert Gu, Yixuan Li, Christopher Ré:
Model Patching: Closing the Subgroup Performance Gap with Data Augmentation. CoRR abs/2008.06775 (2020) - [i12]Albert Gu, Tri Dao, Stefano Ermon, Atri Rudra, Christopher Ré:
HiPPO: Recurrent Memory with Optimal Polynomial Projections. CoRR abs/2008.07669 (2020) - [i11]Ines Chami, Albert Gu, Vaggos Chatziafratis, Christopher Ré:
From Trees to Continuous Embeddings and Back: Hyperbolic Hierarchical Clustering. CoRR abs/2010.00402 (2020) - [i10]Nimit Sharad Sohoni, Jared A. Dunnmon, Geoffrey Angus, Albert Gu, Christopher Ré:
No Subclass Left Behind: Fine-Grained Robustness in Coarse-Grained Classification Problems. CoRR abs/2011.12945 (2020) - [i9]Tri Dao, Nimit Sharad Sohoni, Albert Gu, Matthew Eichhorn, Amit Blonder, Megan Leszczynski, Atri Rudra, Christopher Ré:
Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps. CoRR abs/2012.14966 (2020)
2010 – 2019
- 2019
- [c8]Albert Gu, Frederic Sala, Beliz Gunel, Christopher Ré:
Learning Mixed-Curvature Representations in Product Spaces. ICLR (Poster) 2019 - [c7]Tri Dao, Albert Gu, Matthew Eichhorn, Atri Rudra, Christopher Ré:
Learning Fast Algorithms for Linear Transforms Using Butterfly Factorizations. ICML 2019: 1517-1527 - [c6]Tri Dao, Albert Gu, Alexander Ratner, Virginia Smith, Chris De Sa, Christopher Ré:
A Kernel Theory of Modern Data Augmentation. ICML 2019: 1528-1537 - [i8]Tri Dao, Albert Gu, Matthew Eichhorn, Atri Rudra, Christopher Ré:
Learning Fast Algorithms for Linear Transforms Using Butterfly Factorizations. CoRR abs/1903.05895 (2019) - [i7]Anna C. Gilbert, Albert Gu, Christopher Ré, Atri Rudra, Mary Wootters:
Sparse Recovery for Orthogonal Polynomial Transforms. CoRR abs/1907.08362 (2019) - [i6]Albert Gu, Çaglar Gülçehre, Tom Le Paine, Matthew W. Hoffman, Razvan Pascanu:
Improving the Gating Mechanism of Recurrent Neural Networks. CoRR abs/1910.09890 (2019) - 2018
- [c5]Anna T. Thomas, Albert Gu, Tri Dao, Atri Rudra, Christopher Ré:
Learning Invariance with Compact Transforms. ICLR (Workshop) 2018 - [c4]Frederic Sala, Christopher De Sa, Albert Gu, Christopher Ré:
Representation Tradeoffs for Hyperbolic Embeddings. ICML 2018: 4457-4466 - [c3]Anna T. Thomas, Albert Gu, Tri Dao, Atri Rudra, Christopher Ré:
Learning Compressed Transforms with Low Displacement Rank. NeurIPS 2018: 9066-9078 - [c2]Christopher De Sa, Albert Gu, Rohan Puttagunta, Christopher Ré, Atri Rudra:
A Two-pronged Progress in Structured Dense Matrix Vector Multiplication. SODA 2018: 1060-1079 - [i5]Tri Dao, Albert Gu, Alexander J. Ratner, Virginia Smith, Christopher De Sa, Christopher Ré:
A Kernel Theory of Modern Data Augmentation. CoRR abs/1803.06084 (2018) - [i4]Christopher De Sa, Albert Gu, Christopher Ré, Frederic Sala:
Representation Tradeoffs for Hyperbolic Embeddings. CoRR abs/1804.03329 (2018) - [i3]Anna T. Thomas, Albert Gu, Tri Dao, Atri Rudra, Christopher Ré:
Learning Compressed Transforms with Low Displacement Rank. CoRR abs/1810.02309 (2018) - 2016
- [j2]Albert Gu, Anupam Gupta, Amit Kumar:
The Power of Deferral: Maintaining a Constant-Competitive Steiner Tree Online. SIAM J. Comput. 45(1): 1-28 (2016) - [i2]Albert Gu, Rohan Puttagunta, Christopher Ré, Atri Rudra:
Recurrence Width for Structured Dense Matrix Vector Multiplication. CoRR abs/1611.01569 (2016) - 2015
- [j1]Albert Gu:
Sprague-Grundy Values of the $\mathcal{R}$-Wythoff Game. Electron. J. Comb. 22(2): 2 (2015) - 2013
- [c1]Albert Gu, Anupam Gupta, Amit Kumar:
The power of deferral: maintaining a constant-competitive steiner tree online. STOC 2013: 525-534 - [i1]Albert Gu, Anupam Gupta, Amit Kumar:
The Power of Deferral: Maintaining a Constant-Competitive Steiner Tree Online. CoRR abs/1307.3757 (2013)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:22 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint