Project:World News: Difference between revisions
Start Tag: wikieditor |
GitHub link Tag: wikieditor |
||
| Line 1: | Line 1: | ||
One of the seed datasets for Internet Domains Wikibase, responsible for hundreds of thousands of entries, is the '''World News Project'''. This was a 2019 Google Summer of Code project pursued by Lavanya Singh at the Internet Archive to identify online news services on a worldwide basis, broadly construed. | One of the seed datasets for Internet Domains Wikibase, responsible for hundreds of thousands of entries, is the '''[https://github.com/lsingh123/GSC2019worldnewsproject World News Project]'''. This was a 2019 Google Summer of Code project pursued by Lavanya Singh at the Internet Archive to identify online news services on a worldwide basis, broadly construed. | ||
The World News project drew from a number of sources. These sources are referred to with placeholders throughout the Internet Domains dataset. They correspond to the following services: | The World News project drew from a number of sources. These sources are referred to with placeholders throughout the Internet Domains dataset. They correspond to the following services: | ||
Latest revision as of 00:44, 26 September 2025
One of the seed datasets for Internet Domains Wikibase, responsible for hundreds of thousands of entries, is the World News Project. This was a 2019 Google Summer of Code project pursued by Lavanya Singh at the Internet Archive to identify online news services on a worldwide basis, broadly construed.
The World News project drew from a number of sources. These sources are referred to with placeholders throughout the Internet Domains dataset. They correspond to the following services:
- abyznewslinks: http://www.abyznewslinks.com
- commoncrawl: https://commoncrawl.org
- datastreamer
- dmoz: https://dmoz-odp.org
- gdelt: https://www.gdeltproject.org
- google: https://news.google.com
- inkdrop: https://inkdrop.net/news
- lion: https://www.lionpublishers.com
- mediacloud: https://www.mediacloud.org
- newscrawl
- newscrawls
- newsgrabber: https://wiki.archiveteam.org/index.php/NewsGrabber
- onlineradiobox: https://onlineradiobox.com/us
- original
- prensaescrita: https://www.prensaescrita.com
- ranker
- topnews
- top_news
- usnpl: https://www.usnpl.com
- w3newspapers: https://www.w3newspapers.com
- w3newspapers.com: https://www.w3newspapers.com
- wikidata: https://wikidata.org
- wikinews: https://wikinews.org
- wikipedia: https://wikipedia.org
- wikkipedia: https://wikipedia.org