Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Fundamental Limits of Prompt Compression: A Rate- Distortion Framework for Black-Box Language Models
 
conference poster

Fundamental Limits of Prompt Compression: A Rate- Distortion Framework for Black-Box Language Models

Girish, Adway  
•
Nagle, Alliot
•
Bondaschi, Marco
Show more
September 25, 2024
Advances in Neural Information Processing Systems 37 (NeurIPS 2024)
38th Annual Conference on Neural Information Processing Systems

We formalize the problem of prompt compression for large language models (LLMs) and present a framework to unify token-level prompt compression methods which create hard prompts for black-box models. We derive the distortion-rate function for this setup as a linear program, and provide an efficient algorithm to compute this fundamental limit via the dual of the linear program. Using the distortion-rate function as the baseline, we study the performance of existing compression schemes on a synthetic dataset consisting of prompts generated from a Markov chain, natural language queries, and their respective answers. Our empirical analysis demonstrates the criticality of query-aware prompt compression, where the compressor has knowledge of the downstream task/query for the black-box LLM. We show that there is a large gap between the performance of current prompt compression methods and the optimal strategy, and propose Adaptive QuerySelect, a query-aware, variable-rate adaptation of a prior work to close the gap. We extend our experiments to a small natural language dataset to further confirm our findings on our synthetic dataset.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

7615_Fundamental_Limits_of_Pro.pdf

Type

Main Document

Version

http://purl.org/coar/version/c_970fb48d4fbd8a85

Access type

openaccess

License Condition

CC BY

Size

929.37 KB

Format

Adobe PDF

Checksum (MD5)

7d8859921cb162442bcaaf947d9051e0

Loading...
Thumbnail Image
Name

7615_Fundamental_Limits_of_Pro_Supplementary Material.zip

Access type

openaccess

Size

66.9 MB

Format

ZIP

Checksum (MD5)

c5cf3d70a662c29c3f7389efcb804bcc

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés