Category: Sector: Inclusion

Hard Questions: Inclusion

  • IndicMMLU-Pro: Benchmarking Indic Large Language Models on Multi-Task Language Understanding

    IndicMMLU-Pro: Benchmarking Indic Large Language Models on Multi-Task Language Understanding

    Known by more than 1.5 billion people in the Indian subcontinent, Indic languages present unique challenges and opportunities for natural language processing (NLP) research due to their rich cultural heritage, linguistic diversity, and complex structures. IndicMMLU-Pro is a comprehensive benchmark designed to evaluate Large Language Models (LLMs) across Indic languages, building upon the MMLU Pro (Massive Multitask Language Understanding) framework. Covering major languages such as Hindi, Bengali, Gujarati, Marathi, Kannada, Punjabi, Tamil, Telugu, and Urdu, our benchmark addresses the unique challenges and opportunities presented by the linguistic diversity of the Indian subcontinent. This benchmark encompasses a wide range of tasks in language comprehension, reasoning, and generation, meticulously crafted to capture the intricacies of Indian languages. IndicMMLU-Pro provides a standardized evaluation framework to push the research boundaries in Indic language AI, facilitating the development of more accurate, efficient, and culturally sensitive models. This paper outlines the benchmarks’ design principles, task taxonomy, and data collection methodology, and presents baseline results from state-of-the-art multilingual models.

    More Information

  • Data Journalism Appropriation in African Newsrooms: A Comparative Study of Botswana and Namibia

    Data Journalism Appropriation in African Newsrooms: A Comparative Study of Botswana and Namibia

    Data journalism has received relatively limited academic attention in Southern Africa, with even less focus on smaller countries such as Botswana and Namibia. This article seeks to address this gap by exploring how selected newsrooms in these countries have engaged with data journalism, the ways it has enhanced their daily news reporting, and its impact on newsgathering and production routines. The study reveals varied patterns in the adoption of technology for data journalism across the two contexts. While certain skills remain underdeveloped, efforts to train journalists in data journalism have been evident. These findings support the argument that in emerging economies, the uneven adoption of data journalism technologies is influenced by exposure to these tools and practices.

    More Information

  • Towards regulating AI : A natural, labour and capital resources perspective

    Towards regulating AI : A natural, labour and capital resources perspective

    Policymakers who are looking at artificial intelligence (AI) applications are thinking about what we as a society want to achieve and what we need to protect, yet it is not commonly known that AI apps require intensive natural resources, labour and capital.

    More Information

  • Skills Lab Panel “Building Bridges”, for the Culture and Cohesion Summit.

    Skills Lab Panel “Building Bridges”, for the Culture and Cohesion Summit.

    Join Victoria Kuketz for an intercultural Skills Lab Panel “Building Bridges”, for the Culture and Cohesion Summit.

    More Information

  • ‘We Condemn the Sin, not the Sinner’: Understanding the Attitudes of Adventist and Baptist Believers in Montenegro towards Sexuality

    ‘We Condemn the Sin, not the Sinner’: Understanding the Attitudes of Adventist and Baptist Believers in Montenegro towards Sexuality

    Although we have seen studies on religion and sexuality in the West come to dominate the relevant scholarship, the postsocialist environment with a predominantly Orthodox population has received less attention. This paper fills the gap by examining the attitudes of Adventist and Baptist believers in Montenegro, with a particular focus on abortion, premarital sexual activities and non-heteronormative sexuality. Because we were dealing with a minority population, we opted for a qualitative methodology and conducted semistructured interviews with 17 Adventist and 15 Baptist believers. As observed, our research participants’ position towards sexuality is closely related to the teachings found in the Bible. However, it is possible to observe certain differences between the two groups; for example, the interviewed Adventists tend to be more conservative than the interviewed Baptists. In addition, the male Adventists seem slightly more liberal than female Adventists, whereas the situation is different among Baptists, where women come across as more liberal than men.

    More Information

  • Trustworthy and Responsible AI for Human-Centric Autonomous Decision-Making Systems

    Trustworthy and Responsible AI for Human-Centric Autonomous Decision-Making Systems

    Artificial Intelligence (AI) has paved the way for revolutionary decision-making processes, which if harnessed appropriately, can contribute to advancements in various sectors, from healthcare to economics. However, its black box nature presents significant ethical challenges related to bias and transparency. AI applications are hugely impacted by biases, presenting inconsistent and unreliable findings, leading to significant costs and consequences, highlighting and perpetuating inequalities and unequal access to resources. Hence, developing safe, reliable, ethical, and Trustworthy AI systems is essential. Our team of researchers working with Trustworthy and Responsible AI, part of the Transdisciplinary Scholarship Initiative within the University of Calgary, conducts research on Trustworthy and Responsible AI, including fairness, bias mitigation, reproducibility, generalization, interpretability, and authenticity. In this paper, we review and discuss the intricacies of AI biases, definitions, methods of detection and mitigation, and metrics for evaluating bias. We also discuss open challenges with regard to the trustworthiness and widespread application of AI across diverse domains of human-centric decision making, as well as guidelines to foster Responsible and Trustworthy AI models.

    More Information

  • Advancements in Modern Recommender Systems: Industrial Applications in Social Media, E-commerce, Entertainment, and Beyond

    Advancements in Modern Recommender Systems: Industrial Applications in Social Media, E-commerce, Entertainment, and Beyond

    In the current digital era, the proliferation of online content has overwhelmed users with vast amounts of information, necessitating effective filtering mechanisms. Recommender systems have become indispensable in addressing this challenge, tailoring content to individual preferences and significantly enhancing user experience. This paper delves into the latest advancements in recommender systems, analyzing 115 research papers and 10 articles, and dissecting their application across various domains such as e-commerce, entertainment, and social media. We categorize these systems into content-based, collaborative, and hybrid approaches, scrutinizing their methodologies and performance. Despite their transformative impact, recommender systems grapple with persistent issues like scalability, cold-start problems, and data sparsity. Our comprehensive review not only maps the current landscape of recommender system research but also identifies critical gaps and future directions. By offering a detailed analysis of datasets, simulation platforms, and evaluation metrics, we provide a robust foundation for developing next-generation recommender systems poised to deliver more accurate, efficient, and personalized user experiences, inspiring innovative solutions to drive forward the evolution of recommender technology.

    More Information

  • Unravelling socio-technological barriers to AI integration: A qualitative study of Southern African newsrooms

    Unravelling socio-technological barriers to AI integration: A qualitative study of Southern African newsrooms

    This study explores the socio-technological barriers to the adoption of artificial intelligence (AI)-powered solutions in three countries of the global south – South Africa, Lesotho, Eswatini, Botswana and Zimbabwe. Through 20 in-depth interviews with key stakeholders, it examines the distribution and circulation of AI technologies within selected newsrooms. Furthermore, the article explores socio-technological obstacles to the integration of AI among journalists. Lastly, it examines the consequences of these socio-technological obstacles to journalism. The article specifically seeks to answer three questions: How are AI technologies integrated in southern African newsrooms? What are the socio-technological barriers attendant to the use of AI in selected news organisations of sub-Saharan Africa? What are the implications of these socio-technological barriers to the process of news production in these newsrooms?

    More Information

  • OECD Gender Equality in Technology Governance

    OECD Gender Equality in Technology Governance

    Director Mackenzie represented the Aula Fellowship and the AI context in conversations at this global conference for equity. We stand together, or we fall.

    More Information

  • Decoding the Diversity: A Review of the Indic AI Research Landscape

    Decoding the Diversity: A Review of the Indic AI Research Landscape

    This review paper provides a comprehensive overview of large language model (LLM) research directions within Indic languages. Indic languages are those spoken in the Indian subcontinent, including India, Pakistan, Bangladesh, Sri Lanka, Nepal, and Bhutan, among others. These languages have a rich cultural and linguistic heritage and are spoken by over 1.5 billion people worldwide. With the tremendous market potential and growing demand for natural language processing (NLP) based applications in diverse languages, generative applications for Indic languages pose unique challenges and opportunities for research. Our paper deep dives into the recent advancements in Indic generative modeling, contributing with a taxonomy of research directions, tabulating 84 recent publications. Research directions surveyed in this paper include LLM development, fine-tuning existing LLMs, development of corpora, benchmarking and evaluation, as well as publications around specific techniques, tools, and applications. We found that researchers across the publications emphasize the challenges associated with limited data availability, lack of standardization, and the peculiar linguistic complexities of Indic languages. This work aims to serve as a valuable resource for researchers and practitioners working in the field of NLP, particularly those focused on Indic languages, and contributes to the development of more accurate and efficient LLM applications for these languages.

    More Information