publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2024

  1. aug-llm-inference-xl.gif
    INFERCEPT: Efficient Intercept Support for Augmented Large Language Model Inference
    Reyna Abhyankar, Zijian He, Vikranth Srivatsa, and 2 more authors
    In Forty-first International Conference on Machine Learning, Jul 2024
  2. preble.gif
    Preble: Efficient Distributed Prompt Scheduling for LLM Serving
    Vikranth Srivatsa, Zijian He, Reyna Abhyankar, and 2 more authors
    arXiv preprint arXiv: 2407.00023, Jul 2024

2023

  1. cloudless_architecture.jpg
    Cloudless and Mixclaves
    Vikranth Srivatsa
    EECS Department, University of California, Berkeley, May 2023

2021

  1. waterbirds.png
    The Effect of Model Size on Worst-Group Generalization
    Alan Le Pham, Eunice Chan, Vikranth Srivatsa, and 6 more authors
    In NeurIPS 2021 Workshop on Distribution Shifts: Connecting Methods and Applications, May 2021