Knott, ManuelPerez-Cruz, FernandoDefraeye, Thijs2023-06-052023-06-052023-06-052023-01-1210.1016/j.jfoodeng.2022.111401https://infoscience.epfl.ch/handle/20.500.14299/197990WOS:000990499700001Image-based machine learning models can be used to make the sorting and grading of agricultural products more efficient. In many regions, implementing such systems can be difficult due to the lack of centralization and automation of postharvest supply chains. Stakeholders are often too small to specialize in machine learning, and large training data sets are unavailable. We propose a machine learning procedure for images based on pre-trained Vision Transformers. It is easier to implement than the current standard approach of training Convolutional Neural Networks (CNNs) as we do not (re-)train deep neural networks. We evaluate our approach based on two data sets for apple defect detection and banana ripeness estimation. Our model achieves a competitive classification accuracy equal to or less than one percent below the best-performing CNN. At the same time, it requires three times fewer training samples to achieve a 90% accuracy.Engineering, ChemicalFood Science & TechnologyEngineeringFood Science & Technologymachine learningcomputer visionfood qualitypostharvestclassificationFacilitated machine learning for image-based fruit quality assessmenttext::journal::journal article::research article