livctr commited on
Commit
c7611cd
·
1 Parent(s): 5b6b5b1

Modify comments

Browse files

Former-commit-id: 01c3f9a8e04cb0917a510f1774a1a647b69f994c

data_pipeline/conference_scraper.py CHANGED
@@ -1,4 +1,4 @@
1
- """Scrape data from some famous ML conferences and saves into `DataPaths.CONFERENCE_DIR`.
2
 
3
  Every scrape function returns a list of 3-lists of the form
4
  [paper_title, paper_authors, paper_url].
@@ -16,13 +16,12 @@ CVPR: 2023, 2024
16
 
17
  Disclaimer
18
  -----------
19
- The choice of conferences was sourced from here:
20
  https://www.kaggle.com/discussions/getting-started/115799
21
 
22
- The priority of including certain conferences and tracks was based on a 1st-year PhD's
23
- judgment. Some very top conferences were excluded due to higher activation energy to
24
- scrape data and/or the ignorance of the 1st-year PhD. Some notable exceptions include
25
- ICLR, ICCV, ECCV, ACL, NAACL, and many others.
26
  -----------
27
  """
28
 
 
1
+ """Scrape data from some famous ML conferences and save to `DataPaths.CONFERENCE_DIR`.
2
 
3
  Every scrape function returns a list of 3-lists of the form
4
  [paper_title, paper_authors, paper_url].
 
16
 
17
  Disclaimer
18
  -----------
19
+ The choice of conferences is sourced from here:
20
  https://www.kaggle.com/discussions/getting-started/115799
21
 
22
+ The inclusion of certain conferences and tracks is based on arbitrary judgment. Some
23
+ very top conferences were excluded due to difficulty in scraping data and limited
24
+ time. Some notable exceptions include ICLR, ICCV, ECCV, ACL, NAACL, and many others.
 
25
  -----------
26
  """
27