Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Semi-supervised dimensionality reduction for analyzing high-dimensional data with constraints
 
research article

Semi-supervised dimensionality reduction for analyzing high-dimensional data with constraints

Yan, Su
•
Bouaziz, Sofien  
•
Lee, Dongwon
Show more
2012
Neurocomputing

In this paper, we present a novel semi-supervised dimensionality reduction technique to address the problems of inefficient learning and costly computation in coping with high-dimensional data. Our method named the dual subspace projections (DSP) embeds high-dimensional data in an optimal low-dimensional space, which is learned with a few user-supplied constraints and the structure of input data. The method projects data into two different subspaces respectively the kernel space and the original input space. Each projection is designed to enforce one type of constraints and projections in the two subspaces interact with each other to satisfy constraints maximally and preserve the intrinsic data structure. Compared to existing techniques, our method has the following advantages: (1) it benefits from constraints even when only a few are available; (2) it is robust and free from overfitting; and (3) it handles nonlinearly separable data, but learns a linear data transformation. As a conclusion, our method can be easily generalized to new data points and is efficient in dealing with large datasets. An empirical study using real data validates our claims so that significant improvements in learning accuracy can be obtained after the DSP-based dimensionality reduction is applied to high-dimensional data. (C) 2011 Elsevier B.V. All rights reserved.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

thumbnail.png

Type

Thumbnail

Access type

openaccess

License Condition

copyright

Size

117.83 KB

Format

PNG

Checksum (MD5)

e9f6f3aa359dd123a2dd1b28113abeac

Loading...
Thumbnail Image
Name

DSP_NC12.pdf

Type

Preprint

Version

http://purl.org/coar/version/c_71e4c1898caa6e32

Access type

openaccess

License Condition

copyright

Size

377.43 KB

Format

Adobe PDF

Checksum (MD5)

455aca7cf933f4977c02e69cd9e0e326

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés