Singapore ACM SIGKDD Symposium 2024

On 31 July 2024, the Singapore Chapter of SIGKDD organized a full-day symposium. It was a rich program that among other things featured presentations of papers by authors based in Singapore that had been accepted in recent conferences such as KDD-24, SIGIR-24, ICML-24, IJCAI-24, WWW-24, ACL-24, NeurIPS-23, etc. The event was attended by more than 50 participants from various universities and companies.

Hady Lauw (Chapter Chair) opened the session and introduced the chapter and the day’s schedule

The day featured three invited talks on various topics. The first morning talk was given by Zhenhua Dong, who gave us comprehensive perspectives on biases in recommender system, covering both out of distribution and unfairness. The second morning talk was given by Wenxuan Zhang, who took us through the several interesting works in advancing multilingual large language models. The afternoon talk was given by Shonali Krishnaswamy, who gave a captivating sharing of her successful AI startup journey.

Invited speakers Zhenhua Dong and Wenxuan Zhang providing useful insights to the audience

For the first time, we ran the presentations along two tracks: Research Track, managed by Yuan Fang and Cheng Long, as well as Applied Data Science Track, managed by Yong Liu and Wee Siong Ng. The research track featured seven oral presentations and eleven posters, while the applied data science track featured two oral presentations.

The oral sessions showcased recently published papers by Singapore-based authors

At the end of the day, we announced several awards based on audience feedback, but the reality was that all attendees won, because we all gained insights and friendships.

Unlocking Personalization in Large-Scale Recommendations

Date: September 22, 2023

Time: 4pm to 5pm

Venue: SMU School of Computing and Information Systems 2, Seminar Room B1.2

Abstract

In this talk, we delve into the fascinating world of large-scale recommender systems, exploring three key dimensions: Conformity, Exploration and Context Awareness. We present innovative approaches to break free from conformity bias, infuse exploration to capture evolving user interests and incorporate context awareness for more personalized recommendations. Through rigorous experimentation and real-world deployments on platforms like Facebook Watch, we unveil how these methodologies revolutionize recommendation systems. Join us to uncover the secrets behind unlocking personalization in large-scale recommendations, offering profound insights that bridge the gap between user engagement and system performance.

Speakers

  • Khushhall Chandra Mahajan is a Senior Machine Learning Engineer and Researcher working in recommendation systems. Presently, he is an integral part of the Video Recommendations team at Meta, where he plays a pivotal role in crafting cutting-edge ML algorithms. These algorithms directly enhance the accuracy and impact of video recommendations used by billions of users worldwide. His work powers the recommendation engine for Facebook Watch and Reels, reaching over 1.25 billion users monthly. Prior to this role, he was in the Ads team, developing innovative Ads solutions that drove top-line revenue for the company. He has published several research papers in the domain of machine learning and recommendation systems. His research interest focuses on exploration & diversity in recommendation system and realtime ranking. Additionally he has served on the program committee of various top-tier international machine learning conferences such as NeurIPS, ICLR, ECIR, AISTATS, etc and organizer of the VideoRecsys workshop in ACM RecSys conference. As a fascinating side note, his innovation extends beyond machine learning; he has also developed the Swarachakra Bangla Android keyboard, which has garnered over a million downloads.
  • Amey Porobo Dharwadker works as a Machine Learning Engineering Manager at Meta, where he leads the Facebook Video Recommendations Core Ranking team responsible for developing personalization models used by billions of users around the world. His work has been instrumental in driving the remarkable growth in active users for Facebook, by powering the success of Facebook Watch and Reels, reaching more than 1.25 billion monthly users. Prior to that, he delivered significant user engagement and revenue growth for Facebook through advancements in News Feed and Ads Machine Learning. He has published several research papers in the fields of large-scale recommendation systems and machine learning. He also actively participates as a program committee member for top-tier AI venues including AISTATS, AAAI, IJCAI, ECIR, CIKM, etc. and organizes the VideoRecsys workshop at ACM RecSys conference. He also serves on the juries of renowned global technology competitions, including the Edison Awards and Globee Information Technology Awards.

Photos

Thank you Amey and Khushhall for taking the time to share your work on recommender systems with the SIGKDD community in Singapore.

Singapore ACM SIGKDD Symposium 2023

On 19 July 2023, the Singapore Chapter of SIGKDD organized a full-day symposium. It was a rich program that among other things featured presentations of papers by authors based in Singapore that had been accepted in recent conferences such as KDD-23, SIGIR-23, ACL-23, IJCAI-23, etc. It was a time for learning as well as networking and fellowship, attracting 70 participants with significant representations from both industry and academia. The event was sponsored by Huawei, as well as SMU School of Computing and Information Systems.

Below is a brief photo-story of the event.

Hady Lauw (SIGKDD Chapter Chair & Associate Professor, Singapore Management University) opened the session by introducing the Singapore ACM SIGKDD Chapter.
Rui Zhang (Principal Researcher at Huawei and Visiting Professor at Tsinghua University) discussed the opportunities and challenges of large language model-based information retrieval.
Fabian Suchanek (Professor at Télécom Paris University/ Institut Polytechnique de Paris in France) presented a well-received invited talk on ontologies, engaging the attendees throughout and concluding to a rousing round of applause.
The afternoon featured a wide-ranging discussion on data science for smart nation, with a diverse panel, featuring (from left) See-Kiong Ng (Professor at NUS), Jing Jiang (Professor at SMU), Jinmiao Chen (Principal Investigator at A*STAR), and Zhongwen Huang (Director of Smart City Projects Office).
The programme featured 9 papers as oral presentations, with representations not only from various conferences, but also from the local universities including NUS, NTU, SMU, and SUTD.
Another 10 papers were presented as digital posters, projected on a 49″ monitor. In addition, there were 3 posters that were work-in-progress, and we hope that some of them would eventually appear in some future KDD conference.
Importantly, our objective was also to gather people for networking and fellowship. In science, as in other endeavours, it’s important not just what you know, but also whom you know, for collaborations and ideas keep flowing.
Any gathering in Singapore would not be complete without food. Wow, there was food galore. Some even commented that the food might be better than some conferences where the presented papers were originally accepted.

We’re certainly excited to do this all over again. Keep a look out for our next event!

Let’s Get Connected with KDD.SG Seminar on Graphs

In this seminar by KDD.SG held on April 26, 2023, we featured three experts on graphs.

Talk #1: Title: Efficient Graph Representations for Dynamic Processing on GPUs: Leveraging Dynamic Data Structures for Adaptability and Versatility

Abstract: Dynamic graph processing has become increasingly important in various fields, such as social network analysis, disease surveillance, and transaction monitoring, where graphs evolve over time or need to be analyzed in real-time. Graphics Processing Units (GPUs) offer massive parallelism and high computational power, making them an ideal choice for processing large-scale dynamic graphs. However, efficient processing of these dynamic graphs on GPUs demands adaptive and versatile graph representations. In this talk, we delve into the design of innovative graph representations that enable efficient dynamic graph processing on GPUs. We will discuss the characteristics of our graph representation, which utilizes dynamic data structures to cater to different update requirements. By highlighting the importance of adaptability and versatility in graph representations, attendees will gain valuable insights into optimizing dynamic graph processing on GPUs and be better equipped to enhance their own graph-based applications.

Bio: Yuchen Li is an assistant professor with the School of Computing and Information Systems, Singapore Management University (SMU). He received double BSc degrees in applied math and computer science (both with first-class honors) and a Ph.D. degree in computer science from the National University of Singapore (NUS), in 2013 and 2016, respectively.

Talk #2: Dense subgraph mining: problems, algorithms and applications

Abstract: A graph represents a set entities as vertices and the relations among the entities as edges and is used for modelling data in many applications. A subgraph of vertices in a graph, which are connected via many edges, is called a dense subgraph and often conveys meaningful knowledge of the graph. Finding dense subgraphs in a graph has a wide range of applications including correlation mining, fraud detection, e-commerce, bioinformatics, frequent pattern mining, and community detection, etc. In this talk, I will first introduce some representative applications, density measures and problems of dense subgraph mining. I will then present several of our recent studies of finding dense subgraphs in bipartite graphs and uncertain graphs and their applications in fraud detection, community search, and fake news detection. Finally, I will discuss some potential future work directions.

Bio: LONG Cheng is currently an Assistant Professor at the School of Computer Science and Engineering (SCSE), Nanyang Technological University (NTU). He got the PhD degree from Hong Kong University of Science and Technology (HKUST) in 2015. His research interests are broadly in data management and data mining.

Talk #3 Low-Resource Learning on Graphs
Abstract: Graph structures are ubiquitous in various domains, ranging from social networks and e-commerce platforms to transportation and biological systems. On these graphs, various graph-based analytics and mining tasks exist, many of which can be cast as instances of link prediction, node classification, and graph classification. Moving away from manual feature engineering, graph neural networks (GNN) have witnessed widespread success in various application scenarios due to their ability to learn powerful graph representations automatically. However, their success is often dependent on the availability and quality of graph structures and labeled data, without which their performance can suffer. In this talk, we explore alternative learning paradigms different from the traditional supervised learning paradigm, specifically addressing two types of low-resource scenarios on graphs: structure scarcity and label scarcity. We will first provide an overview of low-resource problems and methods on graphs, and then introduce some of our representative works on these problems.

Bio: Dr. Yuan Fang is an Assistant Professor at the School of Computing and Information Systems at Singapore Management University (SMU). He was previously a data scientist at DBS Bank and a research scientist at A*STAR. His research interests revolve around graph-based learning and its applications in recommender systems, social network analysis, and bioinformatics.

KDD for a Safer Online Social Space

On January 26, 2022, SIGKDD Singapore held an online lunch-time seminar featuring two talks on KDD for a Safer Online Social Space by two experts in the field.


Talk#1: Perils and Promises of Automated Hate Speech Detection

Abstract: Online hate speech is an important issue that breaks the cohesiveness of online social communities and even raises public safety concerns in our societies. Motivated by this rising issue, researchers have developed many traditional machine learning and deep learning methods to detect hate speech in online social platforms automatically. This talk aims to introduce the pressing problem of online hate speeches and discuss the automated hate speech detection methods. Specifically, we will discuss the various NLP approaches for hate speech detection and highlight the potential future research directions such as multimodal and multilingual hate speech detection.

Bios: Roy is an Assistant Professor at the Information Systems Technology and Design Pillar, Singapore University of Technology and Design. He is a faculty of the transformative Design and Artificial Intelligence program. His research lies in the intersection of data mining, machine learning, social computing, and natural language processing. He is leading the Social AI Studio, a research group that focuses on designing the next-generation social artificial intelligence systems. He has published in top-tier venues in data mining and computation linguistics domains. He serves in the program committees of multiple top conferences. He is currently part of the editorial board for the Social Network Analysis and Mining journal.


Talk#2: Rumor Detection with Generative Adversarial Learning

Abstract: Online rumors can cause devastating outcomes to individuals and society. Analysis shows that the widespread of rumors typically results from deliberate promotion of uncredited information aiming to shape the public opinions. On the other hand, fact-checking currently follows investigative journalism requiring significant amount of time and manual effort, which cannot keep the pace of generation of various rumors on a daily basis. In this talk, I will introduce automatic approaches, techniques and new development for combating online rumors in social media from the perspective of natural language processing and adversarial learning. I will also discuss the characterization of online rumors including their linguistic, temporal and propagational features and dynamics, together with some takeaways from past and ongoing research.

Bio: Wei Gao is currently an Assistant Professor of Computer Science at Singapore Management University. His research generally interests natural language processing, information retrieval, artificial intelligence and social computing. Currently he has been working on the topic of rumor detection and computational fact checking. His publications appear in the major international venues including ACL, AAAI/IJCAI, SIGIR, WSDM, WWW, ACM TOIS, IEEE TKDE, etc. He broadly serves the top conferences and leading journals in his relevant field. He is an Associate Editor of ACM TALLIP and a member of standing review committees of Transactions of the ACL and Computational Linguistics Journal.