Skip to content
GCC AI Research

Search

Results for "handwriting recognition"

Transformers of the handwritten word

MBZUAI ·

MBZUAI researchers have developed an AI program using vision transformers that can learn a person's handwriting style and generate text in that style. The US Patent and Trademark Office recently granted a patent for this technology, which could aid individuals with writing impairments. The system overcomes limitations of previous GAN-based approaches by processing long-range dependencies in handwriting. Why it matters: This patented AI tool enhances personalized text generation and has potential applications in assistive technology and improving handwriting recognition models.

Window-Based Descriptors for Arabic Handwritten Alphabet Recognition: A Comparative Study on a Novel Dataset

arXiv ·

This paper introduces a novel dataset for Arabic handwritten isolated alphabet letters to serve as a benchmark for future research. The study presents a comparative evaluation of window-based descriptors for Arabic handwritten alphabet recognition, testing different descriptors with various classifiers. The experiments demonstrate that window-based descriptors perform well, especially when combined with a novel spatial pyramid partitioning scheme. Why it matters: The new dataset and analysis of descriptors will help advance Arabic OCR and handwritten text recognition systems.

AlexU-Word: A New Dataset for Isolated-Word Closed-Vocabulary Offline Arabic Handwriting Recognition

arXiv ·

Researchers from Alexandria University introduce AlexU-Word, a new dataset for offline Arabic handwriting recognition. The dataset contains 25,114 samples of 109 unique Arabic words, covering all letter shapes, collected from 907 writers. The dataset is designed for closed-vocabulary word recognition and to support segmented letter recognition-based systems. Why it matters: This dataset can help advance Arabic handwriting recognition systems, addressing a need for high-quality Arabic datasets in NLP research.

Biometric Recognition: How Do I Know Who You Are?

MBZUAI ·

A public talk announcement features Professor Anil K. Jain from Michigan State University discussing biometric recognition. The talk will cover automated recognition of individuals based on biological and behavioral traits. It will also address challenges, research opportunities, and ongoing projects in Jain's lab related to biometrics. Why it matters: As biometric technologies become increasingly integrated into daily life across the Middle East, understanding their limitations and ethical implications is crucial for responsible development and deployment.

Dates Fruit Disease Recognition using Machine Learning

arXiv ·

This paper proposes a machine learning method for early detection and classification of date fruit diseases, which are economically important to countries like Saudi Arabia. The method uses a hybrid feature extraction approach combining L*a*b color features, statistical features, and Discrete Wavelet Transform (DWT) texture features. Experiments using a dataset of 871 images achieved the highest average accuracy using Random Forest (RF), Multilayer Perceptron (MLP), Naïve Bayes (NB), and Fuzzy Decision Trees (FDT) classifiers.

A Missing and Found Recognition System for Hajj and Umrah

arXiv ·

A proposed recognition system aims to identify missing persons, deceased individuals, and lost objects during the Hajj and Umrah pilgrimages in Saudi Arabia. The system intends to leverage facial recognition and object identification to manage the large crowds expected in the coming decade, estimated to reach 20 million pilgrims. It will be integrated into the CrowdSensing system for crowd estimation, management, and safety.

Continuous Saudi Sign Language Recognition: A Vision Transformer Approach

arXiv ·

The researchers introduce KAU-CSSL, the first continuous Saudi Sign Language (SSL) dataset focusing on complete sentences. They propose a transformer-based model using ResNet-18 for spatial feature extraction and a Transformer Encoder with Bidirectional LSTM for temporal dependencies. The model achieved 99.02% accuracy in signer-dependent mode and 77.71% in signer-independent mode, advancing communication tools for the SSL community.