Metadata-Version: 2.1
Name: preprocessingtweet
Version: 0.1.2
Summary: Preprocessing tweets prior to use in Transformers
Author: Olivier Philippe
Author-email: olivier.philippe@gmail.com
Requires-Python: >=3.10,<3.12
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Requires-Dist: click (>=8.1.3,<9.0.0)
Requires-Dist: emot (>=3.1,<4.0)
Requires-Dist: gensim (>=4.3.0,<5.0.0)
Requires-Dist: joblib (>=1.2.0,<2.0.0)
Requires-Dist: nltk (>=3.8.1,<4.0.0)
Requires-Dist: numpy (>=1.24.1,<2.0.0)
Requires-Dist: regex (>=2022.10.31,<2023.0.0)
Requires-Dist: scipy (>=1.10.0,<2.0.0)
Requires-Dist: six (>=1.16.0,<2.0.0)
Requires-Dist: tqdm (>=4.64.1,<5.0.0)
Description-Content-Type: text/markdown

# dataPreprocessing

Parsing tweet object to lower and clean text. Remove entities (RT, Hashtags, mentions, urls) from the text (can keep the hashtags) and can replace them with placeholder.
Do some basic retweet detection to return if it is a Retweet or no.



