{"id":247,"date":"2017-07-10T15:45:01","date_gmt":"2017-07-10T15:45:01","guid":{"rendered":"http:\/\/eurekadata.net\/?page_id=247"},"modified":"2017-10-06T15:47:10","modified_gmt":"2017-10-06T15:47:10","slug":"shakespeare-redux","status":"publish","type":"page","link":"https:\/\/eurekadata.net\/index.php\/shakespeare-redux\/","title":{"rendered":"Shakespeare Redux"},"content":{"rendered":"<p><img loading=\"lazy\" class=\"alignnone size-full wp-image-124\" src=\"http:\/\/eurekadata.net\/wp-content\/uploads\/2017\/07\/shakespeare.gif\" alt=\"William Shakespeare\" width=\"222\" height=\"282\" \/><\/p>\n<h2>Shakespeare Redux<\/h2>\n<p>As an example of a small corpus of natural language the complete works of William Shakespeare as published by Gutenburg Press consist of 1\/8 million lines, just under one million words, and 5.46 megabytes of text are\u00a0<a href=\"http:\/\/shakespeare.mit.edu\">freely available<\/a> from MIT.<\/p>\n<p>Some format is not entirely consistent across this corpus but generally so with the text being largely just Shakespeare&#8217;s words with 170 lines of preamble, 400 lines of licensing post notes and 220 copyright notices interspersed.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Shakespeare Redux As an example of a small corpus of natural language the complete works of William Shakespeare as published by Gutenburg Press consist of 1\/8 million lines, just under one million words, and 5.46 megabytes of text are\u00a0freely available from MIT. Some format is not entirely consistent across this corpus but generally so with &hellip; <a href=\"https:\/\/eurekadata.net\/index.php\/shakespeare-redux\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Shakespeare Redux<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":[],"_links":{"self":[{"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/pages\/247"}],"collection":[{"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/comments?post=247"}],"version-history":[{"count":4,"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/pages\/247\/revisions"}],"predecessor-version":[{"id":407,"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/pages\/247\/revisions\/407"}],"wp:attachment":[{"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/media?parent=247"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}