A number of methods in order to perform Natural Language Processing on live data derived from Twitter

Last update: Nov 24, 2021

Related tags

Overview

Twitter_NLP

Link to Project: https://twitoff-amadou.herokuapp.com/

==Description==

This project integrates a number of methods in order to perform Natural Language Processing (NLP) on live data derived from Twitter. The goal of this project is to demonstrate how NLP can be used at a basic level to classify hypertext by which Twitter user is most likely to 'tweet' (or post) it. For this project, Twitter API access had been granted, and implemented with the Tweepy wrapper for python.

To start, the web app it built using the Flask platform and is deployed on Heroku. For the functionality of the project, data is extracted from Twitter using its API and the Tweepy library and is fed into SQLAlchemy tables. These tables which hold a variety of information we're concerned with, such as the usernames and past tweeting data, are integrated with our PostgreSQL database. The Spacy library is then responsible for vectorizing our tweets into components our models can operate on. Finally, a random forest classifier is tasked with receiving and training on these vectors.

The interface of the app is quite intuitive. There are two text boxes, one labeled "User to add" and the other, "Tweet text to predict". The user is expected to type a name into the 'add' box, such that Tweepy can add the respective twitter user(s) and their tweeting data to our PostgreSQL database. Our random forest will then train live on the inputted values. Once this has been accomplished with at least two Twitter users in the database, one can add text into the 'predict' box, select the two users they wish to compare and let our model produce a result.

A number of methods in order to perform Natural Language Processing on live data derived from Twitter

Related tags

Overview

Twitter_NLP

==Description==

Owner

Disfl-QA: A Benchmark Dataset for Understanding Disfluencies in Question Answering

Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks

Weird Sort-and-Compress Thing

A natural language modeling framework based on PyTorch

Auto_code_complete is a auto word-completetion program which allows you to customize it on your needs

A curated list of FOSS tools to improve the Hacker News experience

Suite of 500 procedurally-generated NLP tasks to study language model adaptability

PyTorch implementation of the NIPS-17 paper "Poincaré Embeddings for Learning Hierarchical Representations"

PortaSpeech - PyTorch Implementation

Ukrainian TTS (text-to-speech) using Coqui TTS

This project uses unsupervised machine learning to identify correlations between daily inoculation rates in the USA and twitter sentiment in regards to COVID-19.

AI-powered literature discovery and review engine for medical/scientific papers

Simple, Fast, Powerful and Easily extensible python package for extracting patterns from text, with over than 60 predefined Regular Expressions.

뉴스 도메인 질의응답 시스템 (21-1학기 졸업 프로젝트)

nlabel is a library for generating, storing and retrieving tagging information and embedding vectors from various nlp libraries through a unified interface.

Application to help find best train itinerary, uses speech to text, has a spam filter to segregate invalid inputs, NLP and Pathfinding algos.

Malware-Related Sentence Classification

A modular Karton Framework service that unpacks common packers like UPX and others using the Qiling Framework.

PyTorch implementation of Tacotron speech synthesis model.

2021搜狐校园文本匹配算法大赛baseline