The Symposium on Data Mining and Applications (SDMA 2014) was organized by MEGDAM to foster collaboration among data mining and machine learning researchers in Saudi Arabia, GCC countries, and the Middle East. The symposium covered areas such as statistics, computational intelligence, pattern recognition, databases, Big Data Mining and visualization. Acceptance was based on originality, significance and quality of contribution.
This paper focuses on analyzing surveys of women entrepreneurs in the UAE using machine learning techniques. The goal is to extract relevant insights from the data to understand the current landscape and predict future trends. The study aims to support better business decisions related to women in entrepreneurship.
A new dataset called the Saudi Privacy Policy Dataset is introduced, which contains Arabic privacy policies from various sectors in Saudi Arabia. The dataset is annotated based on the 10 principles of the Personal Data Protection Law (PDPL) and includes 1,000 websites, 4,638 lines of text, and 775,370 tokens. The dataset aims to facilitate research and development in privacy policy analysis, NLP, and machine learning applications related to data protection.
This paper introduces a novel fuzzy clustering method for circular time series based on a new dependence measure that considers circular arcs. The algorithm groups series generated from similar stochastic processes and demonstrates computational efficiency. The method is applied to time series of wind direction in Saudi Arabia, showcasing its practical potential.
This study investigates the correlation between Google Trends data for COVID-19 symptoms and the actual number of COVID-19 cases in Saudi Arabia between March and October 2020. The researchers found that searches for "cough" and "sore throat" were most frequent, while "loss of smell", "loss of taste", and "diarrhea" showed the highest correlation with confirmed cases. The study concludes that Google searches can serve as a supplementary surveillance tool for monitoring the spread of COVID-19 in Saudi Arabia. Why it matters: The research demonstrates the potential of using readily available digital data to augment traditional surveillance methods for public health monitoring in the region.