Continuous Saudi Sign Language Recognition: A Vision Transformer Approach

arXiv · September 3, 2025 · Significant research

Summary

The researchers introduce KAU-CSSL, the first continuous Saudi Sign Language (SSL) dataset focusing on complete sentences. They propose a transformer-based model using ResNet-18 for spatial feature extraction and a Transformer Encoder with Bidirectional LSTM for temporal dependencies. The model achieved 99.02% accuracy in signer-dependent mode and 77.71% in signer-independent mode, advancing communication tools for the SSL community.

Keywords

Saudi Sign Language · SSL · KAU-CSSL · Transformer · ResNet-18

Read original article →

Get the weekly digest

Top AI stories from the GCC region, every week.

Tomato Maturity Recognition with Convolutional Transformers

arXiv · Jul 4

This paper introduces a convolutional transformer model for classifying tomato maturity, along with a new UAE-sourced dataset, KUTomaData, for training segmentation and classification models. The model combines CNNs and transformers and was tested against two public datasets. Results showed state-of-the-art performance, outperforming existing methods by significant margins in mAP scores across all three datasets.

A Culturally-diverse Multilingual Multimodal Video Benchmark & Model

arXiv · Jun 8

A new benchmark, ViMUL-Bench, is introduced to evaluate video LLMs across 14 languages, including Arabic, with a focus on cultural inclusivity. The benchmark includes 8k manually verified samples across 15 categories and varying video durations. A multilingual video LLM, ViMUL, is also presented, along with a training set of 1.2 million samples, with both to be publicly released.

Continuous Saudi Sign Language Recognition: A Vision Transformer Approach

Summary

Keywords

Related

Tomato Maturity Recognition with Convolutional Transformers

A Culturally-diverse Multilingual Multimodal Video Benchmark & Model