Source Code Analysis and Natural Language Lab

SCANL is a diverse team of scientists dedicated to studying the latent connection between source code behavior and the natural language elements used to describe that behavior.

Interests

Program Comprehension and Textual Analysis.

There is a strong relationship between the natural language (e.g., found in identifiers) and behavior of source code; developers use this relationship to understand the code they read daily. We explore this relationship by studying rename refactorings, grammar patterns, and static source code analysis. Our goal is to support stronger techniques to automate identifier naming as well as support developers in reading and comprehending code more quickly. This is the research topic that underlies all other research we do. Please check our core research section to see what recent work we have done in this area.

Program Transformation and Refactoring

Program transformations allow us to modify code programmatically. It is important to ensure these techniques are safe, customizable, and easily integrated with today’s software development processes such that developers can, for example, migrate APIs or refactor. We support transformations both through our research on identifier naming and through the creation of flexible, easy-to-use techniques for creating and applying program transformations.

Static Source Code Analysis

A lot of our work relies on static analysis techniques, and most frequently we make use of the srcML Framework to normalize, transform, and analyze source code. Our lab supports several tools built on srcML in addition to hosting Dr. Emily Hill’s natural language framework, SWUM. We are dedicated to providing high-quality research tools and data sets for software research and development. Check our Github page regularly to see what we have to offer and feel free to contact us with questions.

Latest News

Last updated on Apr 27, 2025 1 min read

SCANL presenting at ICPC 2025!

SCANL will be presenting work at ICPC 2025 titled: SCALAR: A Part-of-speech Tagger for Identifiers. The tool we are presenting is SCALAR, a part-of-speech tagger for code. Click the link and check it out!

Last updated on Jan 25, 2024 1 min read

Congratulations to our graduated members!

Definitely late with this update, but Anthony Peruma and Reem Alsuhaibani both graduated and moved into their own faculty positions at University of Hawaii, and Prince Sultan University respectively. Congratulations to them both!

Last updated on May 21, 2022 2 min read

SCANL is presenting at ICSE, ICPC, NLBSE, and MSR

SCANL has several publications, posters, and talks going on at ICSE, ICPC, NLBSE, and MSR 2022!

Last updated on May 18, 2022 1 min read

Publication accepted in ICPC

“An Approach to Automatically Assess Method Names” was accepted for publication in the International Conference on Program Comprehension (ICPC)!

Last updated on Mar 1, 2022 1 min read

Publication accepted in NLBSE

“Understanding Digits in Identifier Names: An Exploratory Study was accepted for publication in the International Workshop on Natural Language-based Software Engineering (NLBSE)!

See all posts