Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Khattat: Enhancing Readability and Concept Representation of Semantic Typography
 
conference paper

Khattat: Enhancing Readability and Concept Representation of Semantic Typography

Hussein, Ahmed
•
Elsetohy, Alaa
•
Hadhoud, Sama
Show more
DelBue, A
•
Canton, C
Show more
May 12, 2025
Computer Vision – ECCV 2024 Workshops Milan, Italy, September 29–October 4, 2024, Proceedings
18th European Conference on Computer Vision

Designing expressive typography that visually conveys a word's meaning while maintaining readability is a complex task, known as semantic typography. It involves selecting an idea, choosing an appropriate font, and balancing creativity with legibility. We introduce an end-to-end system that automates this process. First, a Large Language Model (LLM) generates imagery ideas for the word, useful for abstract concepts like "freedom." Then, the FontCLIP pre-trained model automatically selects a suitable font based on its semantic understanding of font attributes. The system identifies optimal regions of the word for morphing and iteratively transforms them using a pre-trained diffusion model. A key feature is our OCR-based loss function, which enhances readability and enables simultaneous stylization of multiple characters. We compare our method with other baselines, demonstrating great readability enhancement and versatility across multiple languages and writing scripts.

  • Details
  • Metrics
Type
conference paper
DOI
10.1007/978-3-031-92808-6_18
Web of Science ID

WOS:001544980800014

Author(s)
Hussein, Ahmed

Egyptian Knowledge Bank (EKB)

Elsetohy, Alaa

Mohamed bin Zayed University of Artificial Intelligence MBZUAI

Hadhoud, Sama

Mohamed bin Zayed University of Artificial Intelligence MBZUAI

Bakr, Tameem

Mohamed bin Zayed University of Artificial Intelligence MBZUAI

Rohaim, Yasser

Egyptian Knowledge Bank (EKB)

AlKhamissi, Badr  

École Polytechnique Fédérale de Lausanne

Editors
DelBue, A
•
Canton, C
•
Pont-Tuset, J
•
Tommasi, T
Date Issued

2025-05-12

Publisher

Springer Nature

Publisher place

Cham

Published in
Computer Vision – ECCV 2024 Workshops Milan, Italy, September 29–October 4, 2024, Proceedings
ISBN of the book

978-3-031-92807-9

978-3-031-92808-6

Book part number

Part V

Series title/Series vol.

Lecture Notes in Computer Science; 15627

ISSN (of the series)

0302-9743

1611-3349

Start page

278

End page

295

Subjects

Semantic Typography

•

Multi-letter

•

Multilingual

•

OCR Loss

•

Large Language Models

•

Font Selection

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
NLP  
Event nameEvent acronymEvent placeEvent date
18th European Conference on Computer Vision

ECCV 2024

Milan, Italy

2024-09-29 - 2024-10-04

Available on Infoscience
September 19, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/254208
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés