Papers
arxiv:2305.11364

Visualizing Linguistic Diversity of Text Datasets Synthesized by Large Language Models

Published on May 19, 2023
· Submitted by akhaliq on May 22, 2023
Authors:

Abstract

Large language models (LLMs) can be used to generate smaller, more refined datasets via few-shot prompting for benchmarking, fine-tuning or other use cases. However, understanding and evaluating these datasets is difficult, and the failure modes of LLM-generated data are still not well understood. Specifically, the data can be repetitive in surprising ways, not only semantically but also syntactically and lexically. We present LinguisticLens, a novel inter-active visualization tool for making sense of and analyzing syntactic diversity of LLM-generated datasets. LinguisticLens clusters text along syntactic, lexical, and semantic axes. It supports hierarchical visualization of a text dataset, allowing users to quickly scan for an overview and inspect individual examples. The live demo is available at shorturl.at/zHOUV.

Community

2023-06-24 13:19:14,577 [INFO] IP 49.207.193.6 query What is PM Modi's full schedule in the US? recommended Qs ["PM Modi's Egypt visit purpose?", "What were the highlights of PM Modi's US visit?", "What was the response to PM Modi's address to US Congress?"]
2023-06-24 13:20:02,911 [INFO] IP 49.207.193.6 Query What were the highlights of PM Modi's US visit? result :

What were the highlights of PM Modi's US visit?

Prime Minister Narendra Modi's US visit was a key milestone in strengthening bilateral ties between the two nations. During his visit, he had a rare address to a joint session of the US Congress, met with business leaders and Indian expats, and attended a state dinner at the White House hosted by US President Joe Biden. The visit also saw the signing of several agreements, including a potential jet engine manufacturing deal between General Electric and Hindustan Aeronautics, and discussions on India's trade ties with the US. The visit was widely covered in leading American newspapers, with prominent coverage including reports on technology and defense agreements signed between the two countries.

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2305.11364 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2305.11364 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2305.11364 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.