Spaces:
Sleeping
Sleeping
| license: mit | |
| title: Tweet-Wayback | |
| sdk: streamlit | |
| emoji: ๐ | |
| colorFrom: yellow | |
| colorTo: indigo | |
| app_file: app.py | |
| # Wayback Tweets | |
| [](https://doi.org/10.5281/zenodo.12528448) [](https://pypi.org/project/waybacktweets) [](https://github.com/claromes/waybacktweets/actions/workflows/docs.yml) [](https://waybacktweets.streamlit.app) | |
| Retrieves archived tweets CDX data from the Wayback Machine, performs necessary parsing (see [Field Options](https://claromes.github.io/waybacktweets/field_options.html)), and saves the data in HTML (for easy viewing of the tweets using the `iframe` tag), CSV, and JSON formats. | |
| ## Installation | |
| ```shell | |
| pip install waybacktweets | |
| waybacktweets --from 20150101 --to 20191231 --limit 250 jack | |
| from waybacktweets import WaybackTweets, TweetsParser, TweetsExporter | |
| USERNAME = "jack" | |
| api = WaybackTweets(USERNAME) | |
| archived_tweets = api.get() | |
| if archived_tweets: | |
| field_options = [ | |
| "archived_timestamp", | |
| "original_tweet_url", | |
| "archived_tweet_url", | |
| "archived_statuscode", | |
| ] | |
| parser = TweetsParser(archived_tweets, USERNAME, field_options) | |
| parsed_tweets = parser.parse() | |
| exporter = TweetsExporter(parsed_tweets, USERNAME, field_options) | |
| exporter.save_to_csv() |