More about HKUST
Making Data Communication Effective and Efficient in Computational Notebooks
PhD Thesis Proposal Defence Title: "Making Data Communication Effective and Efficient in Computational Notebooks" by Miss Yanna LIN Abstract: In the era of big data, data science has become pivotal in extracting and generating insights from vast amounts of structured and unstructured data. Computational notebooks have emerged as essential tools in this field, integrating codes, outputs, and explanatory texts to create a computational narrative that enhances the exploration and communication of complex data insights. Despite their widespread adoption, significant challenges persist in ensuring effective data communication within these notebooks, particularly related to the quality of explanatory texts, the integration of texts, codes, and outputs, and the diverse needs of stakeholders. This thesis addresses these challenges through the development of innovative solutions aimed at improving the efficiency and effectiveness of data communication within computational notebooks. In the first work, we introduced InkSight, a mixed-initiative plugin that automatically generates explanatory texts for chart outputs based on users' intents expressed through sketches. Recognizing that users still face difficulties in relating the explanatory texts to the corresponding charts and codes, we designed a second plugin, InterLink, to help clarify the relationships and cross-references between these elements. Beyond facilitating continued data exploration, some stakeholders prefer alternative formats of computational notebooks, such as data comics, to gain high-level insights and avoid the clutter of interim notes and findings. To address this, in the third work, we proposed DMiner, a data-driven framework that automates the layout and interaction designs of selected visualizations into data comics, meeting the diverse preferences of different audiences. Finally, we discuss future research directions to further enhance the efficiency and effectiveness of data communication in computational notebooks. Date: Friday, 7 June 2024 Time: 3:00pm - 5:00pm Venue: Room 3494 Lifts 25/26 Committee Members: Prof. Huamin Qu (Supervisor) Prof. Cunsheng Ding (Chairperson) Dr. Wei Zeng Dr. Lionel Parreaux