Taking Stock of Stock Analysis

On 1 April 2019, we saw the power of machine learning techniques in building an automated stock investment system.  A/Prof Carol Hargreaves described a data science approach to identify top stocks, a trading system that not only she built with her expertise but also used by her personally.  We are grateful for the support from the host ING Bank, and for having Annerie Vreugdenhil (Head of Innovation, ING Wholesale Banking) open the event.

Screenshot 2019-04-02 at 10.55.29 AM.png

Agenda for the Evening

2019-04-03 18.20.32.jpg

Hady Lauw (Chapter Chair) outlined the agenda for the evening, as well as introduced the Singapore Chapter of ACM SIGKDD and our activities covering tutorials, seminars, and symposia to contribute to the data science community in Singapore

2019-04-03 18.20.35.jpg

Annerie Vreugdenhil (CIO of ING Wholesale Banking) shared about the innovation activities pursued by ING

 

2019-04-01-18.54.55.jpg

Carol Hargreaves (Associate Professor, NUS) described the broad arc of her approach in using multiple techniques such as PCA, regression and K-means for identifying top stocks

2019-04-01-19.02.20.jpg

Beyond the concepts, Carol showed a live demo of the techniques

2019-04-01-19.23.57-e1554174796670.jpg

The talk was engaging leading to many follow-up questions from the crowd which included domain experts and banking professionals, as well as data science enthusiasts

KDD.SG Seminar organized by SIGKDD Singapore & ING Bank

SIGKDD Singapore & ING Bank are jointly organizing this seminar delivered by A/Prof Carol Hargreaves, Director of the Data Analytics Consulting Centre at the National University of Singapore. The event will be opened by Annerie Vreugdenhil, Head of Innovation, ING Wholesale Banking.

Using Machine Learning Techniques to Identify Top Stocks

April 1st 2019 | 6.00pm to 8.00pm | ING Bank, Spark #12; 1 Wallich Street, Guoco Tower, 078881

Abstract

Interested in what goes into building an automated stock investment system? Using machine learning techniques of both unsupervised and supervised varieties, one can mine troves of data to discover statistical patterns towards identifying top stocks to trade. Demonstrated on a case study, we will see how the learned models perform upon paper trading, comparing the stock portfolio performance to that of the stock market index.

Speaker

photo_carolhargreavesA/Prof Carol Hargreaves is the Director of the Data Analytics Consulting Centre at the National University of Singapore. Prof Hargreaves has a joint position in the Department of Mathematics & the Department of Statistics & Applied Probability. Her role includes providing data analytics advisory & consulting services to industry, designing and teaching data analytics executive training courses for industry professionals, and is a noted keynote conference speaker and moderator.

Prof Hargreaves has a passion for solving business problems using analytics and machine learning techniques to build data driven solutions for growth and effective decision-making. She aims to provide her clients with organic revenue growth, allowing business processes to become smarter and faster while keeping customers engaged.

Prof Hargreaves is an analytics and business intelligence professional with over 30 years analytics experience, with leading roles in the pharmaceutical, healthcare & fast moving consumer goods and education industries. Prof Hargreaves has worked with a variety of leading companies to make businesses more intelligent. These include Pfizer, Novartis, Merck Sharp & Dohme, Nestle, MasterFoods, Goodman Fielder, Foxtel, Aztec (IRI), Cegedim Strategic Data (Quintiles), National Health & Medical Research Council, National University of Singapore.

Date/Time Monday 1 April 2019, beginning with light reception at 6.00pm

Venue ING Bank, Spark @ level 12, 1 Wallich Street, Guoco Tower, 078881

RSVP on our Meetup

See you there!

What Should A Data Scientist Know?

That’s the question that a group of data scientists from industry and academia discussed over dinner on 14 December 2018 at the NUSS Suntec City Guild House.  We were fortunate to have Marianne Winslett, a professor emerita of computer science at the University of Illinois at Urbana–Champaign, to open the event with a keynote address. We see the role of SIGKDD as contributing to identifying and building the skills required for data science professionals.  Though simple and direct answers to such a complex question are elusive, the discussions were fruitful and pointed us in the right direction. It was also a chance for us to reflect and regroup as a community.

IMG_6447.JPG

Marianne Winslett (UIUC) gave her take on the question, including the need for standards to ensure that professionals could build something that’d work, the need for a realistic mindset aware of the limitations of statistics, as well as the attention to ethical issues

IMG_6515.JPG

In between dinner courses, we had breakout discussions on the topic, moderated by Giuseppe Manai (Chapter Secretary)

IMG_6523.JPG

Though it was not by deliberate design, one table had primarily industry professionals, which brought forth issues on how to hire and select the right data scientists

IMG_6525.JPG

The other table just happened to have many academicians, with discussions touching on the skills and competencies of data scientists

IMG_6536.JPG

Hady Lauw (Chapter Chair) closed the event, summarizing the noted points for future follow-ups.

IMG_6592.JPG

A fruitful discussion over a satisfying dinner with a growing community.  From left: Cheng Long (NTU), Aixin Sun (NTU), Yuchen Li (SMU), Serene Ow (Grab), Graham Williams (Microsoft), Joao Gama (DataRobot), Giuseppe Manai (Chapter Secretary, ING FutureLabs Ventures), Xiaoli Li (I2R), Aloysius Lim (Chapter Membership Chair, Eureka AI), Huayu Wu (DBS), Bing Tian Dai (SMU), Jing Jiang (SMU), Marianne Winslett (UIUC), Hady Lauw (Chapter Chair, SMU)

 

 

Networking over Networks

On 18 Sep, we had a chance to learn about the power of networks from Dr. Gábor Benedek (Lynx Analytics), while networking with data science enthusiasts over pizza and beer. We are also grateful to WeWork for hosting our event.  Here are a few pictures to remember by.

kddtutorial3_intro.jpg

Giuseppe Manai (Chapter Secretary) introduced the speaker Gábor Benedek, PhD, who would be speaking about Big Graph Data Intelligence

kddtutorial3_gabor.jpg

In addition to graph theory and techniques, Gábor covered several case studies relating to social and health sciences as well

kddtutorial3_pizza

Networks gave us something to “chew on”, while pizza gave us something to chew while networking

kddtutorial3_wework2.jpg

WeWork provided a conducive space for learning together

 

KDD.SG Tutorial organized by SIGKDD Singapore & WeWork

Big Graph Data Intelligence – Analyzing Large Connected Data in Social and Health Science

September 18, 2018 | 6pm to 8.30pm | WeWork 71 Robinson

Abstract

The miracle behind Social Data is that we have information on the detailed structure of how people are connected to each other, who are the family members, who are the friends, and who are the influencers, who are the followers. We call this structure as graph topology. But beside the Social Data Topology (from diverse sources) we can also observe characteristics of and behavior mechanism among the individuals. Many researches have proved that inside micro-topologies (cliques or communities) people tend to think, decide, purchase or do similar things, have similar profiles in many cases. Thus, if we want to understand or change customers’ decision, we must use the micro-topology information, not just individual connections.

Today, multinational companies (banks, airlines, telecoms, insurance companies and many other domains) are closer than ever to analyze, understand and utilize Social Data. They usually have at least three different sources of Social Data, which is sufficient to build their own transactional Social Network. First, A-to-B transactions (calls, instant messages, money transfers, bookings). Second, (co-)locations and the (co-)movements (same address, sharing bills, traveling together). Third, digital behavior (browsing history, app usage) of customers, potentially complemented with external information. Just like Facebook helps not only to derive the Network, but add much diverse information on attributes and interests. This in turn enables further deep dive on the homogeneity of these communities, verify those cases when the network is similar in demography, when the network is giving an insight on commercial decisions, when the network enables the spread of word of mouth.

However, creating the transactional Network using these attributes is not easy, especially when there are dozens of millions of customers, and billions of possible and measurable interactions of and between them. The different sources of information can contain contradictions and confusions, when we observe that overlaid topologies are not matching. (E.g.: online friends and offline friends.) Lynx Analytics has experiences and solutions to build and use these networks efficiently.

Speaker

GaborBenedek.png

Gábor Benedek is an innovation partner at Lynx Analytics providing predictive analytics for communication companies, financial services and healthcare sectors in South-East-Asia. He has been applying SNA methodologies for Celcom, Indosat, Singtel, Telkomsel, Globe, DBS Bank in the region. Gábor received his PhD in 2003, in 2012 his T-Mobile SNA churn study was awarded as the Best Application paper by the European Decision Science Institute, and he is the author of one book and author/coauthor of over 20 articles. He was an Associate Professor at Corvinus University of Budapest, researching and lecturing in the fields of economic and business simulations, social network analysis, econometrics, data mining and predictive analytics. Gábor was among the founders of Data Explorer, the first consulting company in predictive analytics in Hungary. In 1999 Data Explorer built the first social network analysis software for churn and community detection applicable for mobile customers in Europe, based on Gábor’s theoretical foundations and proposals. In 2010 Gabor was contributing to the largest public physicians’ social network in the world, based on real patient-flow data between general practitioners and specialists.

Date/Time Tuesday 18 September 2018, beginning with a light reception at 6pm

Venue WeWork 71 Robinson Road Singapore 068895

RSVP on our Meetup.

See you there!