Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Accelerating Transfer Learning with Near-Data Computation on Cloud Object Stores
 
conference paper

Accelerating Transfer Learning with Near-Data Computation on Cloud Object Stores

Petrescu, Diana  
•
Guirguis, Arsany  
•
Quoc, Do Le
Show more
November 20, 2024
SoCC 2024 - Proceedings of the 2024 ACM Symposium on Cloud Computing
15th ACM Symposium on Cloud Computing

Storage disaggregation underlies today's cloud and is naturally complemented by pushing down some computation to storage, thus mitigating the potential network bottleneck between the storage and compute tiers. We show how ML training benefits from storage pushdowns by focusing on transfer learning (TL), the widespread technique that democratizes ML by reusing existing knowledge on related tasks. We propose HAPI, a new TL processing system centered around two complementary techniques that address challenges introduced by disaggregation. First, applications must carefully balance execution across tiers for performance. HAPI judiciously splits the TL computation during the feature extraction phase yielding pushdowns that not only improve network time but also improve total TL training time by overlapping the execution of consecutive training iterations across tiers. Second, operators want resource efficiency from the storage-side computational resources. HAPI employs storage-side batch size adaptation allowing increased storage-side pushdown concurrency without affecting training accuracy. HAPI yields up to 2.5× training speed-up while choosing in 86.8% of cases the best performing split point or one that is at most 5% off from the best.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1145/3698038.3698549
Scopus ID

2-s2.0-85215537905

Author(s)
Petrescu, Diana  

École Polytechnique Fédérale de Lausanne

Guirguis, Arsany  

École Polytechnique Fédérale de Lausanne

Quoc, Do Le

Huawei Munich Research Center

Picorel, Javier

Huawei Munich Research Center

Guerraoui, Rachid  

EPFL

Dinu, Florin

Huawei Munich Research Center

Date Issued

2024-11-20

Publisher

Association for Computing Machinery, Inc

Publisher place

New York

Published in
SoCC 2024 - Proceedings of the 2024 ACM Symposium on Cloud Computing
ISBN of the book

9798400712869

Start page

995

End page

1011

Subjects

Near-data processing

•

object stores

•

transfer learning

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
SACS  
DCL  
Event nameEvent acronymEvent placeEvent date
15th ACM Symposium on Cloud Computing

SoCC '24

Redmond, WA, USA

2024-11-20 - 2024-11-22

Available on Infoscience
February 10, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/246697
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés