fathan commited on
Commit
396e92c
1 Parent(s): 7d983c7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -30,12 +30,14 @@ The following are few examples of the keyword phrases:
30
  - great kalian semua
31
  - chattingane ilang
32
  - baru aja launching
 
33
  We acquire 40,788,384 raw tweets. We apply first stage pre-processing tasks such as:
34
  - remove duplicate tweets,
35
  - remove tweets with token length less than 5,
36
  - remove multiple space,
37
  - convert emoticon,
38
  - convert all tweets to lower case.
 
39
  After the first stage pre-processing, we obtain 17,385,773 tweets.
40
  In the second stage pre-processing, we do the following pre-processing tasks:
41
  - split the tweets into sentences,
 
30
  - great kalian semua
31
  - chattingane ilang
32
  - baru aja launching
33
+
34
  We acquire 40,788,384 raw tweets. We apply first stage pre-processing tasks such as:
35
  - remove duplicate tweets,
36
  - remove tweets with token length less than 5,
37
  - remove multiple space,
38
  - convert emoticon,
39
  - convert all tweets to lower case.
40
+
41
  After the first stage pre-processing, we obtain 17,385,773 tweets.
42
  In the second stage pre-processing, we do the following pre-processing tasks:
43
  - split the tweets into sentences,