The Voyant Corpus
I analyzed the data from two Voyant Corpus documents to better understand the various word repetitions in each.
This is done with the goal of comprehending how the Voyant classifies papers that share a common structure as well as documents that have various subjects or content from one article to the next.
For instance, First-Person Narratives of the American South, which are more prone to differences within documents, may yield different findings from the U.S. President’s Inaugural Speeches, which typically cover fewer topics and have a better chance of being more structurally similar to their counterparts.
It’s crucial to remember that these facts and hypotheses are based on just 2 papers. It’s all just speculation, but I find it fascinating since it helps me better grasp how the Voyant words function as software.
The First-Person Narratives of the American South
The First-Person Narratives of the American South had
8,168,336 total words
73,000 distinct word forms
Without directly reading the documents, which would take a long time given their length and written form, one could distantly read the articles and documents to comprehend the contents by analyzing the most frequent and distinctive words.
The frequent keywords within narratives could be distantly read and understood as a report, journal entry, or narrative, as the documents are.
By analyzing a graph that tracks their use for each document, they give the reader a visual example of the nature of the said document.
By analyzing the distinct words, the notion is that these journal or narrative entries are more easily understood. They consist of names within the documents. It would be much simpler to read distinctively in this circumstance because it makes it simple for someone hunting for a certain person to locate them and their story. You would be able to determine the nature of the document based on its common phrases, context, and word count without even reading it.
Additionally, The Voyant links frequently used terms to one another, making it easier to see how those keywords relate to other documents and their contents.
The U.S. President’s Inaugural Speeches
The U.S. President’s Inaugural Speeches had
134,770 total words
9,305 unique word forms.
By themselves, the document’s Frequent and Distinctive terms imply that it is legal and possibly political in nature.
I concluded that using Voyant tools made it simpler to evaluate and comprehend distant large-scale texts and documents. Because of the time savings, users could now learn more. I generally take the initiative to learn new things, but reading lengthy documents is really difficult for me since I spend so much time attempting to decipher the words instead of grasping the content of the documents as a whole. The software of the Voyant makes it even simpler to comprehend the documents’ message when read at a distance.
Leave a Reply