Count the frequency of letters or words in a text file and show a graph.

Last update: Apr 09, 2022

Overview

Word Counter

By EBUS Coding Club

Count the frequency of letters or words in a text file and show a graph.

Requirements

Python 3.9 or higher
matplotlib

Usage

Download the source code and unzip the downloaded file. Run pip install -r requirements.txt in the source code directory to install the required packages. Create a text file in the same directory as main.py named input.txt and fill it with text you want to analyze. Run the script in an IDE of your choice or with python main.py.

Objective

Given a text file, count the frequency (number of occurrences) of either letters or words, and show a bar graph to visualize the results. Do not include whitespace or punctuation in the results, with the exception of apostrophes that are inside words.

Next Steps

Add command line arguments for input file path and other options
Add timers for significant steps to diagnose performance
Optimize speed and memory usage
Anything else you can think of to improve the script

License

MIT License

Count the frequency of letters or words in a text file and show a graph.

Related tags

Overview

Word Counter

Requirements

Usage

Objective

Next Steps

License

Owner

EBUS Coding Club

🤗🖼️ HuggingPics: Fine-tune Vision Transformers for anything using images found on the web.

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS)

FactSumm: Factual Consistency Scorer for Abstractive Summarization

KoBERTopic은 BERTopic을 한국어 데이터에 적용할 수 있도록 토크나이저와 BERT를 수정한 코드입니다.

OceanScript is an Esoteric language used to encode and decode text into a formulation of characters

This repository contains helper functions which can help you generate additional data points depending on your NLP task.

Repositório do trabalho de introdução a NLP

Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3

Kinky furry assitant based on GPT2

Wind Speed Prediction using LSTMs in PyTorch

End-to-End Speech Processing Toolkit

voice2json is a collection of command-line tools for offline speech/intent recognition on Linux

Unsupervised text tokenizer focused on computational efficiency

Backend for the Autocomplete platform. An AI assisted coding platform.

Codes for processing meeting summarization datasets AMI and ICSI.

Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"

Code for the paper "Flexible Generation of Natural Language Deductions"

🏆 • 5050 most frequent words in 109 languages

Need: Image Search With Python

Using context-free grammar formalism to parse English sentences to determine their structure to help computer to better understand the meaning of the sentence.