Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a parser for tweet archives #15

Open
serin-delaunay opened this issue Nov 6, 2017 · 0 comments
Open

Add a parser for tweet archives #15

serin-delaunay opened this issue Nov 6, 2017 · 0 comments

Comments

@serin-delaunay
Copy link
Owner

User ebooks accounts are a common type of bot. They generate (presumably Markov) text based on a user's tweet archives. CMTQ can parse uploaded files, but its current line-separation method is not suited to extracting tweet text from tweet archive files.

  • Add a way to upload a tweet archive file (a second file upload button?)
  • Parse tweet archive files to strip out metadata
  • Use each tweet's text as a line of source text.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant