GCC AI Research

Archive Monthly

August 2023

4 articles

Top Stories

Meet Jais, The World’s Most Advanced Arabic LLM - G42

Inception · · NLP LLM

G42's Core42 has released Jais, a new Arabic large language model. Jais includes 13 billion parameters and was trained on a dataset of 126B tokens, including 43B Arabic tokens. According to the developers, Jais achieves state-of-the-art results on Arabic benchmarks and competitive performance on English benchmarks. Why it matters: Jais represents a significant step forward for Arabic NLP, providing a powerful new tool for researchers and developers in the region.

PECon: Contrastive Pretraining to Enhance Feature Alignment between CT and EHR Data for Improved Pulmonary Embolism Diagnosis

arXiv · · Healthcare CV

This paper introduces Pulmonary Embolism Detection using Contrastive Learning (PECon), a supervised contrastive pretraining strategy using both CT scans and EHR data to improve feature alignment between modalities for better PE diagnosis. PECon pulls sample features of the same class together while pushing away features of other classes. The approach achieves state-of-the-art results on the RadFusion dataset, with an F1-score of 0.913 and AUROC of 0.943.