Загрузка...

Robust and Calibrated Lightweight Transformers for Imbalanced Industrial Text Classification

#Transformers #LightweightModels #EfficientAI #ImbalancedData #TextClassification #NLP #NaturalLanguageProcessing #IndustrialAI #AIinIndustry #RobustAI #ModelCalibration #CalibratedModels #MachineLearning #DeepLearning #AIResearch #AppliedAI #DataImbalance #ClassImbalance #AIModels #ScalableAI #EdgeAI #ModelOptimization #EfficientNLP #TrustworthyAI #ExplainableAI #AIEngineering #ComputerScience #ResearchPaper #AcademicResearch #InnovationInAI

Robust and Calibrated Lightweight Transformers for Imbalanced Industrial Text Classification
Youssef Alothman 1 and Mohamed Bader-El-Den 2 , 1 University of Portsmouth, Portsmouth, UK, 2 Abdullah Al Salem University, Kuwait

Abstract

The semiconductor manufacturing sector produces enormous amounts of textual data that is highly imbalanced, non-stationary, and operationally critical. Although transformer-based language models achieve strong classification accuracy, their robustness and probability calibration under industrial constraints remain insufficiently addressed, particularly in resource-limited deployments. This paper proposes LiteFormer, a lightweight and calibrated transformer framework for imbalanced industrial text classification. The technique combines a geometry-informed minority over-sampling technique with D-SMOTE, imbalance-informed optimization with Focal Loss, and a post-hoc temperature scaling method. The technique outperforms standard transformer models on a large-scale industrial Root Cause Analysis data set, obtaining higher macro-F1 and significantly better Expected Calibration Error, while remaining computationally efficient. The technique performs robustly even when faced with temporal and domain shifts.

Keywords:Imbalanced text classification, lightweight transformers, probability calibration, focal loss, industrial NLP

Abstract URL :https://aircconline.com/csit/abstract/v16n08/csit160807.html

Article full text :https://aircconline.com/csit/papers/vol16/csit160810.pdf

Volume URL : https://airccse.org/csit/V16N08.html

Видео Robust and Calibrated Lightweight Transformers for Imbalanced Industrial Text Classification канала Computer Science & IT Conference Proceedings
Яндекс.Метрика
Все заметки Новая заметка Страницу в заметки
Страницу в закладки Мои закладки
На информационно-развлекательном портале SALDA.WS применяются cookie-файлы. Нажимая кнопку Принять, вы подтверждаете свое согласие на их использование.
О CookiesНапомнить позжеПринять