Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Reducing Annotation Efforts in Electricity Theft Detection Through Optimal Sample Selection
 
research article

Reducing Annotation Efforts in Electricity Theft Detection Through Optimal Sample Selection

Liao, Wenlong  
•
Bak-Jensen, Birgitte
•
Pillai, Jayakrishnan Radhakrishna
Show more
2024
IEEE Transactions on Instrumentation and Measurement

Supervised machine learning models are receiving increasing attention in electricity theft detection due to their high detection accuracy. However, their performance depends on a massive amount of labeled training data, which comes from time-consuming and resource-intensive annotations. To maximize model performance within a limited annotation budget, this article aims to reduce the annotation effort in electricity theft detection through optimal sample selection. In particular, a general framework and three new strategies are proposed to select the most valuable and representative samples from different perspectives, including uncertainty, class imbalance, and diversity of samples. In-depth simulations and analyses are conducted to evaluate the effectiveness of the proposed strategies on commonly used machine learning models and a real-world dataset. Simulation results show that the proposed strategies significantly outperform baselines on datasets of different sizes and fraudulent ratios. Besides, the proposed strategies are effective in improving detection performance across a range of classifiers.

  • Details
  • Metrics
Type
research article
DOI
10.1109/TIM.2024.3352696
Scopus ID

2-s2.0-85182917020

Author(s)
Liao, Wenlong  

École Polytechnique Fédérale de Lausanne

Bak-Jensen, Birgitte
Pillai, Jayakrishnan Radhakrishna
Xia, Xiaofang
Ruan, Guangchun
Yang, Zhe
Date Issued

2024

Published in
IEEE Transactions on Instrumentation and Measurement
Volume

73

Article Number

3508911

Start page

1

End page

11

Subjects

Data annotation

•

electricity theft

•

machine learning

•

sample selection

•

smart grid

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
WIRE  
Available on Infoscience
January 16, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/242854
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés