Research Highlights: ExBERT

Print Friendly, PDF & Email

In the insideBIGDATA Research Highlights column we take a look at new and upcoming results from the research community for data science, machine learning, AI and deep learning. Our readers need to get a glimpse for technology coming down the pipeline that will make their efforts more strategic and competitive. In this installment we review a new paper: EXBERT: A Visual Analysis Tool to Explore Learned Representations in Transformers Models by researchers from the MIT-IBM Watson AI Lab and Harvard. The group presents the latest from “ExBERT,” a tool that allows you to go under the hood of a language model and gather previously inaccessible details, like what information the model uses to autocomplete words and phrases.

Large Transformer-based language models can route and reshape complex information via their multi-headed attention mechanism. Although the attention never receives explicit supervision, it can exhibit understandable patterns following linguistic or positional information. To further our understanding of the inner workings of these models, we need to analyze both the learned representations and the attentions.

To support analysis for a wide variety of Transformer models, the researchers introduce exBERT, a tool to help humans conduct flexible, interactive investigations and formulate hypotheses for the model-internal reasoning process. exBERT provides insights into the meaning of the contextual representations and attention by matching a human-specified input to similar contexts in large annotated datasets.

The fully-featured demo video below shows select Transformer models with the Wizard of Oz and a subset of Wikipedia pre-annotated for the hidden representations for each model.

Sign up for the free insideBIGDATA newsletter.

Speak Your Mind

*