


default search action
Matei Zaharia
Matei A. Zaharia
Person information
- affiliation: Stanford University, CA, USA
- award (2019): Presidential Early Career Award for Scientists and Engineers
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j43]Albert J. Rogers, Neal K. Bhatia, Sabyasachi Bandyopadhyay, James E. Tooley, Rayan Ansari, Vyom Thakkar, Justin Xu, Jessica Torres Soto, Jagteshwar S. Tung, Mahmood I. Alhusseini, Paul Clopton, Reza Sameni, Gari D. Clifford, J. Weston Hughes, Euan A. Ashley, Marco V. Perez, Matei Zaharia, Sanjiv M. Narayan:
Identification of cardiac wall motion abnormalities in diverse populations by deep learning of the electrocardiogram. npj Digit. Medicine 8(1) (2025) - 2024
- [j42]Liana Patel
, Peter Kraft
, Carlos Guestrin
, Matei Zaharia
:
ACORN: Performant and Predicate-Agnostic Search Over Vector Embeddings and Structured Data. Proc. ACM Manag. Data 2(3): 120 (2024) - [j41]Maryann Xue, Yingyi Bu, Abhishek Somani, Wenchen Fan, Ziqi Liu, Steven Chen, Herman Van Hovell, Bart Samwel, Mostafa Mokhtar, Rk Korlapati, Andy Lam, Yunxiao Ma, Vuk Ercegovac, Jiexing Li, Alexander Behm, Yuanjian Li, Xiao Li, Sriram Krishnamurthy, Amit Shukla, Michalis Petropoulos, Sameer Paranjpye, Reynold Xin, Matei Zaharia:
Adaptive and Robust Query Execution for Lakehouses At Scale. Proc. VLDB Endow. 17(12): 3947-3959 (2024) - [c107]Krista Opsahl-Ong, Michael J. Ryan, Josh Purtell, David Broman, Christopher Potts, Matei Zaharia, Omar Khattab:
Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs. EMNLP 2024: 9340-9366 - [c106]Keshav Santhanam
, Deepti Raghavan
, Muhammad Shahir Rahman
, Thejas Venkatesh
, Neha Kunjal
, Pratiksha Thaker
, Philip Alexander Levis, Matei Zaharia
:
ALTO: An Efficient Network Orchestrator for Compound AI Systems. EuroMLSys@EuroSys 2024: 117-125 - [c105]Hao Liu, Matei Zaharia, Pieter Abbeel:
RingAttention with Blockwise Transformers for Near-Infinite Context. ICLR 2024 - [c104]Omar Khattab, Arnav Singhvi, Paridhi Maheshwari, Zhiyuan Zhang, Keshav Santhanam, Sri Vardhamanan, Saiful Haq, Ashutosh Sharma, Thomas T. Joshi, Hanna Moazam, Heather Miller, Matei Zaharia, Christopher Potts:
DSPy: Compiling Declarative Language Model Calls into State-of-the-Art Pipelines. ICLR 2024 - [c103]Jiwon Park
, Shadaj Laddad
, Dev Bali
, Wen Zhang
, Scott Shenker
, Matei Zaharia
:
Everything Everywhere All At Once: Efficient Cross-Service Program Analysis with OverSeer. ASE Workshops 2024: 82-87 - [c102]Jon Saad-Falcon, Omar Khattab, Christopher Potts, Matei Zaharia:
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems. NAACL-HLT 2024: 338-354 - [c101]Lingjiao Chen, Jared Quincy Davis, Boris Hanin, Peter Bailis, Ion Stoica, Matei A. Zaharia, James Y. Zou:
Are More LLM Calls All You Need? Towards the Scaling Properties of Compound AI Systems. NeurIPS 2024 - [c100]Daniel Kang, Xuechen Li, Ion Stoica, Carlos Guestrin, Matei Zaharia, Tatsunori Hashimoto:
Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks. SP (Workshops) 2024: 132-143 - [i92]Hao Liu, Wilson Yan, Matei Zaharia, Pieter Abbeel:
World Model on Million-Length Video And Language With Blockwise RingAttention. CoRR abs/2402.08268 (2024) - [i91]Lingjiao Chen, Jared Quincy Davis, Boris Hanin, Peter Bailis, Ion Stoica, Matei Zaharia, James Zou:
Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems. CoRR abs/2403.02419 (2024) - [i90]Keshav Santhanam, Deepti Raghavan, Muhammad Shahir Rahman, Thejas Venkatesh, Neha Kunjal, Pratiksha Thaker, Philip Alexander Levis, Matei Zaharia:
ALTO: An Efficient Network Orchestrator for Compound AI Systems. CoRR abs/2403.04311 (2024) - [i89]Liana Patel, Peter Kraft, Carlos Guestrin, Matei Zaharia:
ACORN: Performant and Predicate-Agnostic Search Over Vector Embeddings and Structured Data. CoRR abs/2403.04871 (2024) - [i88]Shu Liu, Asim Biswal, Audrey Cheng, Xiangxi Mo, Shiyi Cao, Joseph E. Gonzalez, Ion Stoica, Matei Zaharia:
Optimizing LLM Queries in Relational Workloads. CoRR abs/2403.05821 (2024) - [i87]Tianjun Zhang, Shishir G. Patil, Naman Jain, Sheng Shen, Matei Zaharia, Ion Stoica, Joseph E. Gonzalez:
RAFT: Adapting Language Model to Domain Specific RAG. CoRR abs/2403.10131 (2024) - [i86]Karim Elmaaroufi, Devan Shanker, Ana Cismaru, Marcell Vazquez-Chanlatte, Alberto L. Sangiovanni-Vincentelli, Matei Zaharia, Sanjit A. Seshia:
Generating Probabilistic Scenario Programs from Natural Language. CoRR abs/2405.03709 (2024) - [i85]Krista Opsahl-Ong, Michael J. Ryan, Josh Purtell, David Broman, Christopher Potts, Matei Zaharia, Omar Khattab:
Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs. CoRR abs/2406.11695 (2024) - [i84]Liana Patel, Siddharth Jha, Carlos Guestrin, Matei Zaharia:
LOTUS: Enabling Semantic Queries with LLMs Over Tables of Unstructured and Structured Data. CoRR abs/2407.11418 (2024) - [i83]Jared Quincy Davis, Boris Hanin, Lingjiao Chen, Peter Bailis, Ion Stoica, Matei Zaharia:
Networks of Networks: Complexity Class Principles Applied to Compound AI Systems Design. CoRR abs/2407.16831 (2024) - [i82]Asim Biswal, Liana Patel, Siddarth Jha, Amog Kamsetty, Shu Liu, Joseph E. Gonzalez, Carlos Guestrin, Matei Zaharia:
Text2SQL is Not Enough: Unifying AI and Databases with TAG. CoRR abs/2408.14717 (2024) - [i81]Wilson Yan, Matei Zaharia, Volodymyr Mnih, Pieter Abbeel, Aleksandra Faust, Hao Liu:
ElasticTok: Adaptive Tokenization for Image and Video. CoRR abs/2410.08368 (2024) - [i80]Quinn Leng, Jacob Portes, Sam Havens, Matei Zaharia, Michael Carbin:
Long Context RAG Performance of Large Language Models. CoRR abs/2411.03538 (2024) - [i79]Shiyi Cao, Shu Liu, Tyler Griggs, Peter Schafhalter, Xiaoxuan Liu, Ying Sheng, Joseph E. Gonzalez, Matei Zaharia, Ion Stoica:
MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs. CoRR abs/2411.11217 (2024) - [i78]Mathew Jacob, Erik Lindgren, Matei Zaharia, Michael Carbin, Omar Khattab, Andrew Drozdov:
Drowning in Documents: Consequences of Scaling Reranker Inference. CoRR abs/2411.11767 (2024) - [i77]Ion Stoica, Matei Zaharia, Joseph Gonzalez, Ken Goldberg, Koushik Sen, Hao Zhang, Anastasios Angelopoulos, Shishir G. Patil, Lingjiao Chen, Wei-Lin Chiang, Jared Quincy Davis:
Specifications: The missing link to making the development of LLM systems an engineering discipline. CoRR abs/2412.05299 (2024) - [i76]Aditya Desai, Shuo Yang, Alejandro Cuadron, Ana Klimovic, Matei Zaharia, Joseph E. Gonzalez, Ion Stoica:
HashAttention: Semantic Sparsity for Faster Inference. CoRR abs/2412.14468 (2024) - [i75]Jinhao Zhu, Liana Patel, Matei Zaharia, Raluca Ada Popa:
Compass: Encrypted Semantic Search with High Accuracy. IACR Cryptol. ePrint Arch. 2024: 1255 (2024) - 2023
- [j40]Peter Kraft, Qian Li, Xinjing Zhou, Peter Bailis, Michael Stonebraker, Xiangyao Yu, Matei Zaharia
:
Epoxy: ACID Transactions Across Diverse Data Stores. Proc. VLDB Endow. 16(11): 2742-2754 (2023) - [j39]Matthew Russo, Tatsunori Hashimoto, Daniel Kang
, Yi Sun, Matei Zaharia
:
Accelerating Aggregation Queries on Unstructured Streams of Data. Proc. VLDB Endow. 16(11): 2897-2910 (2023) - [j38]Qian Li, Peter Kraft, Michael J. Cafarella, Çagatay Demiralp, Goetz Graefe, Christos Kozyrakis, Michael Stonebraker, Lalith Suresh, Xiangyao Yu, Matei Zaharia
:
R3: Record-Replay-Retroaction for Database-Backed Applications. Proc. VLDB Endow. 16(11): 3085-3097 (2023) - [c99]Lingjiao Chen, Zhihua Jin, Sabri Eyuboglu, Huamin Qu, Christopher Ré, Matei Zaharia, James Zou:
HAPI Explorer: Comprehension, Discovery, and Explanation on History of ML APIs. AAAI 2023: 16416-16418 - [c98]Keshav Santhanam, Jon Saad-Falcon, Martin Franz, Omar Khattab, Avi Sil, Radu Florian, Md. Arafat Sultan, Salim Roukos, Matei Zaharia, Christopher Potts:
Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking. ACL (Findings) 2023: 11613-11628 - [c97]Paras Jain, Peter Kraft, Conor Power, Tathagata Das, Ion Stoica, Matei Zaharia:
Analyzing and Comparing Lakehouse Storage Systems. CIDR 2023 - [c96]Qian Li, Peter Kraft, Michael J. Cafarella, Çagatay Demiralp, Goetz Graefe, Christos Kozyrakis, Michael Stonebraker, Lalith Suresh, Matei Zaharia:
Transactions Make Debugging Easy. CIDR 2023 - [c95]Pratiksha Thaker, Matei Zaharia
, Tatsunori Hashimoto:
Congestion Control Safety via Comparative Statics. INFOCOM 2023: 1-10 - [c94]Trevor Gale, Deepak Narayanan, Cliff Young, Matei Zaharia:
MegaBlocks: Efficient Sparse Training with Mixture-of-Experts. MLSys 2023 - [c93]Deepti Raghavan
, Shreya Ravi
, Gina Yuan
, Pratiksha Thaker
, Sanjari Srivastava
, Micah Murray
, Pedro Henrique Penna
, Amy Ousterhout
, Philip Alexander Levis, Matei Zaharia
, Irene Zhang
:
Cornflakes: Zero-Copy Serialization for Microsecond-Scale Networking. SOSP 2023: 200-215 - [i74]Daniel Kang, Xuechen Li, Ion Stoica, Carlos Guestrin, Matei Zaharia
, Tatsunori Hashimoto:
Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks. CoRR abs/2302.05733 (2023) - [i73]Francisco Romero, Caleb Winston, Johann Hauswald, Matei Zaharia
, Christos Kozyrakis:
Zelda: Video Analytics using Vision-Language Models. CoRR abs/2305.03785 (2023) - [i72]Lingjiao Chen, Matei Zaharia
, James Zou:
FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance. CoRR abs/2305.05176 (2023) - [i71]Lingjiao Chen, Matei Zaharia
, James Zou:
How is ChatGPT's behavior changing over time? CoRR abs/2307.09009 (2023) - [i70]Matthew Russo, Tatsunori Hashimoto, Daniel Kang, Yi Sun, Matei Zaharia
:
Accelerating Aggregation Queries on Unstructured Streams of Data. CoRR abs/2308.09157 (2023) - [i69]Hao Liu, Matei Zaharia
, Pieter Abbeel:
Ring Attention with Blockwise Transformers for Near-Infinite Context. CoRR abs/2310.01889 (2023) - [i68]Omar Khattab, Arnav Singhvi, Paridhi Maheshwari, Zhiyuan Zhang, Keshav Santhanam, Sri Vardhamanan, Saiful Haq, Ashutosh Sharma, Thomas T. Joshi, Hanna Moazam, Heather Miller, Matei Zaharia
, Christopher Potts:
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines. CoRR abs/2310.03714 (2023) - [i67]Hao Liu, Matei Zaharia
, Pieter Abbeel:
Exploration with Principles for Diverse AI Supervision. CoRR abs/2310.08899 (2023) - [i66]Jon Saad-Falcon, Omar Khattab, Christopher Potts, Matei Zaharia
:
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems. CoRR abs/2311.09476 (2023) - [i65]Lingjiao Chen, Bilge Acun, Newsha Ardalani, Yifan Sun, Feiyang Kang, Hanrui Lyu, Yongchan Kwon, Ruoxi Jia, Carole-Jean Wu, Matei Zaharia
, James Zou:
Data Acquisition: A New Frontier in Data-centric AI. CoRR abs/2311.13712 (2023) - [i64]Zhiling Zheng, Zhiguo He, Omar Khattab, Nakul Rampal, Matei A. Zaharia, Christian Borgs, Jennifer T. Chayes, Omar M. Yaghi:
Image and Data Mining in Reticular Chemistry Using GPT-4V. CoRR abs/2312.05468 (2023) - [i63]Arnav Singhvi, Manish Shetty, Shangyin Tan, Christopher Potts, Koushik Sen, Matei Zaharia, Omar Khattab:
DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines. CoRR abs/2312.13382 (2023) - 2022
- [j37]Mihai Budiu, Pratiksha Thaker
, Parikshit Gopalan, Udi Wieder, Matei Zaharia
:
Overlook: Differentially Private Exploratory Visualization for Big Data. J. Priv. Confidentiality 12(1) (2022) - [j36]Akshay Agrawal, Stephen P. Boyd, Deepak Narayanan, Fiodar Kazhamiaka, Matei Zaharia
:
Allocation of fungible resources via a fast, scalable price discovery method. Math. Program. Comput. 14(3): 593-622 (2022) - [j35]Weixin Liang, Girmaw Abebe Tadesse
, Daniel E. Ho
, Li Fei-Fei, Matei Zaharia
, Ce Zhang, James Zou
:
Advances, challenges and opportunities in creating data for trustworthy AI. Nat. Mach. Intell. 4(8): 669-677 (2022) - [j34]Weixin Liang, Girmaw Abebe Tadesse
, Daniel E. Ho, Li Fei-Fei, Matei Zaharia
, Ce Zhang, James Zou
:
Author Correction: Advances, challenges and opportunities in creating data for trustworthy AI. Nat. Mac. Intell. 4(10): 904 (2022) - [j33]Magdalena Balazinska, Surajit Chaudhuri, AnHai Doan, Joseph M. Hellerstein, Hanuma Kodavalla, Ippokratis Pandis, Matei Zaharia
:
Cloud Data Systems: What are the Opportunities for the Database Research Community? Proc. VLDB Endow. 15(12): 3826-3827 (2022) - [j32]Francisco Romero, Johann Hauswald, Aditi Partap, Daniel Kang
, Matei Zaharia
, Christos Kozyrakis:
Optimizing Video Analytics with Declarative Model Relationships. Proc. VLDB Endow. 16(3): 447-460 (2022) - [j31]Nirvik Baruah, Peter Kraft, Fiodar Kazhamiaka, Peter Bailis, Matei Zaharia
:
Parallelism-Optimizing Data Placement for Faster Data-Parallel Computations. Proc. VLDB Endow. 16(4): 760-771 (2022) - [c92]Cody Coleman, Edward Chou, Julian Katz-Samuels, Sean Culatana, Peter Bailis, Alexander C. Berg, Robert D. Nowak, Roshan Sumbaly, Matei Zaharia, I. Zeki Yalniz:
Similarity Search for Efficient Active Learning and Search of Rare Concepts. AAAI 2022: 6402-6410 - [c91]Qian Li, Peter Kraft, Kostis Kaffes, Athinagoras Skiadopoulos, Deeptaanshu Kumar, Jason Li, Michael J. Cafarella, Goetz Graefe, Jeremy Kepner, Christos Kozyrakis, Michael Stonebraker, Lalith Suresh, Matei Zaharia:
A Progress Report on DBOS: A Database-oriented Operating System. CIDR 2022 - [c90]Daniel Kang, Francisco Romero, Peter D. Bailis, Christos Kozyrakis, Matei Zaharia:
VIVA: An End-to-End System for Interactive Video Analytics. CIDR 2022 - [c89]Keshav Santhanam, Omar Khattab, Christopher Potts, Matei Zaharia
:
PLAID: An Efficient Engine for Late Interaction Retrieval. CIKM 2022: 1747-1756 - [c88]Lingjiao Chen, Matei Zaharia, James Zou:
How Did the Model Change? Efficiently Assessing Machine Learning API Shifts. ICLR 2022 - [c87]Ashwin Paranjape, Omar Khattab, Christopher Potts, Matei Zaharia, Christopher D. Manning:
Hindsight: Posterior-guided training of retrievers for improved open-ended generation. ICLR 2022 - [c86]Lingjiao Chen, Matei Zaharia, James Zou:
Efficient Online ML API Selection for Multi-Label Classification Tasks. ICML 2022: 3716-3746 - [c85]Keshav Santhanam, Omar Khattab, Jon Saad-Falcon, Christopher Potts, Matei Zaharia:
ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction. NAACL-HLT 2022: 3715-3734 - [c84]Lingjiao Chen, Zhihua Jin, Sabri Eyuboglu, Christopher Ré, Matei Zaharia, James Y. Zou:
HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions. NeurIPS 2022 - [c83]Lingjiao Chen, Matei Zaharia, James Y. Zou:
Estimating and Explaining Model Performance When Both Covariates and Labels Shift. NeurIPS 2022 - [c82]Peter Kraft, Fiodar Kazhamiaka, Peter Bailis, Matei Zaharia:
Data-Parallel Actors: A Programming Model for Scalable Query Serving Systems. NSDI 2022: 1059-1074 - [c81]Daniel Kang
, Nikos Aréchiga, Sudeep Pillai, Peter D. Bailis, Matei Zaharia
:
Finding Label and Model Errors in Perception Data With Learned Observation Assertions. SIGMOD Conference 2022: 496-505 - [c80]Daniel Kang
, John Guibas, Peter D. Bailis, Tatsunori Hashimoto, Matei Zaharia
:
TASTI: Semantic Indexes for Machine Learning-based Queries over Unstructured Data. SIGMOD Conference 2022: 1934-1947 - [c79]Alexander Behm, Shoumik Palkar, Utkarsh Agarwal, Timothy Armstrong, David Cashman, Ankur Dave, Todd Greenstein, Shant Hovsepian, Ryan Johnson, Arvind Sai Krishnan, Paul Leventis, Ala Luszczak, Prashanth Menon, Mostafa Mokhtar, Gene Pang, Sameer Paranjpye, Greg Rahn, Bart Samwel, Tom van Bussel, Herman Van Hovell, Maryann Xue, Reynold Xin, Matei Zaharia
:
Photon: A Fast Query Engine for Lakehouse Systems. SIGMOD Conference 2022: 2326-2339 - [i62]Daniel Kang, Nikos Aréchiga, Sudeep Pillai, Peter Bailis, Matei Zaharia:
Finding Label and Model Errors in Perception Data With Learned Observation Assertions. CoRR abs/2201.05797 (2022) - [i61]Gina Yuan, David Mazières, Matei Zaharia:
Extricating IoT Devices from Vendor Infrastructure with Karl. CoRR abs/2204.13737 (2022) - [i60]Keshav Santhanam, Omar Khattab, Christopher Potts, Matei Zaharia
:
PLAID: An Efficient Engine for Late Interaction Retrieval. CoRR abs/2205.09707 (2022) - [i59]Peter Kraft, Qian Li, Kostis Kaffes, Athinagoras Skiadopoulos, Deeptaanshu Kumar, Danny Cho, Jason Li, Robert Redmond, Nathan W. Weckwerth, Brian S. Xia, Peter Bailis, Michael J. Cafarella, Goetz Graefe, Jeremy Kepner, Christos Kozyrakis, Michael Stonebraker, Lalith Suresh, Xiangyao Yu, Matei Zaharia:
Apiary: A DBMS-Backed Transactional Function-as-a-Service Framework. CoRR abs/2208.13068 (2022) - [i58]Lingjiao Chen, Matei Zaharia
, James Zou:
Estimating and Explaining Model Performance When Both Covariates and Labels Shift. CoRR abs/2209.08436 (2022) - [i57]Lingjiao Chen, Zhihua Jin, Sabri Eyuboglu, Christopher Ré, Matei Zaharia
, James Zou:
HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions. CoRR abs/2209.08443 (2022) - [i56]Trevor Gale, Deepak Narayanan, Cliff Young, Matei Zaharia
:
MegaBlocks: Efficient Sparse Training with Mixture-of-Experts. CoRR abs/2211.15841 (2022) - [i55]Keshav Santhanam, Jon Saad-Falcon, Martin Franz, Omar Khattab, Avirup Sil, Radu Florian, Md. Arafat Sultan, Salim Roukos, Matei Zaharia
, Christopher Potts:
Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking. CoRR abs/2212.01340 (2022) - [i54]Omar Khattab, Keshav Santhanam, Xiang Lisa Li, David Hall, Percy Liang, Christopher Potts, Matei Zaharia
:
Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP. CoRR abs/2212.14024 (2022) - [i53]Qian Li, Peter Kraft, Michael J. Cafarella, Çagatay Demiralp, Goetz Graefe, Christos Kozyrakis, Michael Stonebraker, Lalith Suresh, Matei Zaharia
:
Transactions Make Debugging Easy. CoRR abs/2212.14161 (2022) - 2021
- [j30]Daniel Kang
, John Guibas, Peter Bailis, Tatsunori Hashimoto, Yi Sun, Matei Zaharia
:
Accelerating Approximate Aggregation Queries with Expensive Predicates. Proc. VLDB Endow. 14(11): 2341-2354 (2021) - [j29]Matei Zaharia
:
Designing Production-Friendly Machine Learning. Proc. VLDB Endow. 14(13): 3420 (2021) - [j28]Athinagoras Skiadopoulos
, Qian Li, Peter Kraft, Kostis Kaffes, Daniel Hong, Shana Mathew, David Bestor, Michael J. Cafarella, Vijay Gadepally, Goetz Graefe, Jeremy Kepner, Christos Kozyrakis, Tim Kraska, Michael Stonebraker, Lalith Suresh, Matei Zaharia
:
DBOS: A DBMS-oriented Operating System. Proc. VLDB Endow. 15(1): 21-30 (2021) - [j27]Omar Khattab, Christopher Potts, Matei Zaharia
:
Relevance-guided Supervision for OpenQA with ColBERT. Trans. Assoc. Comput. Linguistics 9: 929-944 (2021) - [j26]Firas Abuzaid
, Peter Kraft, Sahaana Suri, Edward Gan, Eric Xu, Atul Shenoy, Asvin Ananthanarayan, John Sheu, Erik Meijer, Xi Wu, Jeffrey F. Naughton, Peter Bailis, Matei Zaharia
:
DIFF: a relational interface for large-scale data explanation. VLDB J. 30(1): 45-70 (2021) - [c78]Fiodar Kazhamiaka, Matei Zaharia, Peter Bailis:
Challenges and Opportunities for Autonomous Vehicle Query Systems. CIDR 2021 - [c77]Matei Zaharia, Ali Ghodsi, Reynold Xin, Michael Armbrust:
Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics. CIDR 2021 - [c76]Pratiksha Thaker, Hudson Ayers, Deepti Raghavan, Ning Niu, Philip Alexander Levis, Matei Zaharia
:
Clamor: Extending Functional Cluster Computing Frameworks with Fine-Grained Remote Memory Access. SoCC 2021: 654-669 - [c75]Pratiksha Thaker, Matei Zaharia
, Tatsunori Hashimoto:
Don't Hate the Player, Hate the Game: Safety and Utility in Multi-Agent Congestion Control. HotNets 2021: 140-146 - [c74]Deepti Raghavan, Philip Alexander Levis, Matei Zaharia
, Irene Zhang:
Breakfast of champions: towards zero-copy serialization with NIC scatter-gather. HotOS 2021: 199-205 - [c73]Deepak Narayanan, Amar Phanishayee, Kaiyu Shi, Xie Chen, Matei Zaharia:
Memory-Efficient Pipeline-Parallel DNN Training. ICML 2021: 7937-7947 - [c72]Omar Khattab, Christopher Potts, Matei A. Zaharia:
Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval. NeurIPS 2021: 27670-27682 - [c71]Firas Abuzaid, Srikanth Kandula, Behnaz Arzani, Ishai Menache, Matei Zaharia, Peter Bailis:
Contracting Wide-area Network Topologies to Solve Flow Problems Quickly. NSDI 2021: 175-200 - [c70]Deepak Narayanan, Mohammad Shoeybi, Jared Casper, Patrick LeGresley, Mostofa Patwary, Vijay Korthikanti, Dmitri Vainbrand, Prethvi Kashinkunti, Julie Bernauer, Bryan Catanzaro, Amar Phanishayee, Matei Zaharia
:
Efficient large-scale language model training on GPU clusters using megatron-LM. SC 2021: 58 - [c69]Deepak Narayanan, Fiodar Kazhamiaka, Firas Abuzaid, Peter Kraft, Akshay Agrawal, Srikanth Kandula, Stephen P. Boyd, Matei Zaharia
:
Solving Large-Scale Granular Resource Allocation Problems Efficiently with POP. SOSP 2021: 521-537 - [c68]Saba Eskandarian, Henry Corrigan-Gibbs, Matei Zaharia, Dan Boneh:
Express: Lowering the Cost of Metadata-hiding Communication with Cryptographic Privacy. USENIX Security Symposium 2021: 1775-1792 - [i52]Omar Khattab, Christopher Potts, Matei Zaharia:
Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval. CoRR abs/2101.00436 (2021) - [i51]Lingjiao Chen, Matei Zaharia, James Zou:
FrugalMCT: Efficient Online ML API Selection for Multi-Label Classification Tasks. CoRR abs/2102.09127 (2021) - [i50]Akshay Agrawal, Stephen P. Boyd, Deepak Narayanan, Fiodar Kazhamiaka, Matei Zaharia:
Allocation of Fungible Resources via a Fast, Scalable Price Discovery Method. CoRR abs/2104.00282 (2021) - [i49]Deepak Narayanan, Mohammad Shoeybi, Jared Casper, Patrick LeGresley, Mostofa Patwary, Vijay Korthikanti, Dmitri Vainbrand, Prethvi Kashinkunti, Julie Bernauer, Bryan Catanzaro, Amar Phanishayee, Matei Zaharia:
Efficient Large-Scale Language Model Training on GPU Clusters. CoRR abs/2104.04473 (2021) - [i48]Deepak Narayanan, Fiodar Kazhamiaka, Firas Abuzaid, Peter Kraft, Matei Zaharia:
Don't Give Up on Large Optimization Problems; POP Them! CoRR abs/2104.06513 (2021) - [i47]Daniel Kang, John Guibas, Peter Bailis, Tatsunori Hashimoto, Yi Sun, Matei Zaharia:
Proof: Accelerating Approximate Aggregation Queries with Expensive Predicates. CoRR abs/2107.12525 (2021) - [i46]Lingjiao Chen, Tracy Cai, Matei Zaharia, James Zou:
Did the Model Change? Efficiently Assessing Machine Learning API Shifts. CoRR abs/2107.14203 (2021) - [i45]Daniel Kang, John Guibas, Peter Bailis, Tatsunori Hashimoto, Yi Sun, Matei Zaharia:
Accelerating Approximate Aggregation Queries with Expensive Predicates. CoRR abs/2108.06313 (2021) - [i44]Ashwin Paranjape, Omar Khattab, Christopher Potts, Matei Zaharia, Christopher D. Manning:
Hindsight: Posterior-guided training of retrievers for improved open-ended generation. CoRR abs/2110.07752 (2021) - [i43]Deepak Narayanan, Fiodar Kazhamiaka, Firas Abuzaid, Peter Kraft, Akshay Agrawal, Srikanth Kandula, Stephen P. Boyd, Matei Zaharia:
Solving Large-Scale Granular Resource Allocation Problems Efficiently with POP. CoRR abs/2110.11927 (2021) - [i42]Keshav Santhanam, Siddharth Krishna, Ryota Tomioka, Tim Harris, Matei Zaharia:
DistIR: An Intermediate Representation and Simulator for Efficient Neural Network Distribution. CoRR abs/2111.05426 (2021) - [i41]Yuezhou Sun, Wenlong Zhao, Lijun Zhang, Xiao Liu, Hui Guan, Matei Zaharia:
Toward Compact Parameter Representations for Architecture-Agnostic Neural Network Compression. CoRR abs/2111.10320 (2021) - [i40]Keshav Santhanam, Omar Khattab, Jon Saad-Falcon, Christopher Potts, Matei Zaharia:
ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction. CoRR abs/2112.01488 (2021) - [i39]Neoklis Polyzotis, Matei Zaharia:
What can Data-Centric AI Learn from Data and ML Engineering? CoRR abs/2112.06439 (2021) - 2020
- [j25]Daniel Kang, Edward Gan, Peter Bailis, Tatsunori Hashimoto, Matei Zaharia:
Approximate Selection with Guarantees using Proxies. Proc. VLDB Endow. 13(11): 1990-2003 (2020) - [j24]