{"id":311,"date":"2017-07-11T05:04:34","date_gmt":"2017-07-11T05:04:34","guid":{"rendered":"http:\/\/eurekadata.net\/?page_id=311"},"modified":"2017-07-26T16:53:09","modified_gmt":"2017-07-26T16:53:09","slug":"terms","status":"publish","type":"page","link":"https:\/\/eurekadata.net\/index.php\/introduction\/terms\/","title":{"rendered":"Terms and Concepts"},"content":{"rendered":"<p>The following are used frequently throughout this documentation.<\/p>\n<ul>\n<li><em>Source of Truth\u00a0<\/em>refers to the concept of data origin, or authority for which all the derivatives, the digests and meta data are based. Though not necessarily constructed of singular system, nor necessarily immutable store, the source of truth would be distinguished from all the other collections of data as an arbiter where multiple copies of essentially the same data disagree and resource for recovery or corrections of \u00a0such discrepancies. \u00a0Though present in most big data systems Eureka would largely downplay the need for this concept of\u00a0<em>source of truth\u00a0<\/em>by\u00a0minimizing the need for the multiple and frequently conflicting derivatives.<\/li>\n<li><em>Corpus<\/em>\u00a0is a collection of works or text, which when ingested into a Eureka data system it is distilled down to a set of statistics that conserves all the information in that collection of text while also exposing all the relevant statistics and join tables for the quickest operations.<\/li>\n<li><em>Join(ing)<\/em>\u00a0means the connection of different data concepts directly. \u00a0Internally it is as though you have random or direct reference to any one value, or one of a set of value, or a range within a set of elements.<\/li>\n<li><em>Lexing<\/em>,\u00a0<em>Token<\/em>\u00a0and\u00a0<em>Space<\/em>\u00a0are the process of ingesting and differentiating data into different classes or types. \u00a0As in reading byte stream of input and transforming that to tokens with annotations such as words, spaces, and delimiters etc. \u00a0The\u00a0<em>space\u00a0<\/em>concept is that its convenient to consider at times to constrain tokens to one class of types such as only words, or only delimiters. \u00a0As tokens are such a central concept in Eureka that\u00a0<a href=\"http:\/\/eurekadata.net\/index.php\/tokens\/\">a large section of this documentation<\/a>\u00a0is dedicated to it.<\/li>\n<li><em>Direct<\/em>(<em>ly<\/em>) or\u00a0<em>Random Access<\/em>\u00a0is the concept of that machine complexity (or order) for a lookup. \u00a0Random access is literally constant time (<em><strong>O<\/strong><\/em>(<em>k<\/em>)). \u00a0Eureka\u2019s access is based on\u00a0<em><strong>O<\/strong><\/em>(log<sub>k<\/sub>(<em>n<\/em>)), where\u00a0<em>k<\/em>\u00a0&gt;&gt; 2, while greater than\u00a0<strong><em>O<\/em><\/strong>(<em>k<\/em>) its approximately and practically equivalent even with large\u00a0<strong><em>n.<\/em><\/strong><\/li>\n<li><em>Magic<\/em>\u00a0as in special refers to the special, frequently unique, character sequences that are integrated into data stream that is the corpus. \u00a0Quite probably, is a temporary crutch, this mechanism permits the current design to proceed until a better solution possibly multi channel data or out of band information is provided.<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>The following are used frequently throughout this documentation. Source of Truth\u00a0refers to the concept of data origin, or authority for which all the derivatives, the digests and meta data are based. Though not necessarily constructed of singular system, nor necessarily immutable store, the source of truth would be distinguished from all the other collections of &hellip; <a href=\"https:\/\/eurekadata.net\/index.php\/introduction\/terms\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Terms and Concepts<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":238,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":[],"_links":{"self":[{"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/pages\/311"}],"collection":[{"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/comments?post=311"}],"version-history":[{"count":4,"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/pages\/311\/revisions"}],"predecessor-version":[{"id":386,"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/pages\/311\/revisions\/386"}],"up":[{"embeddable":true,"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/pages\/238"}],"wp:attachment":[{"href":"https:\/\/eurekadata.net\/index.php\/wp-json\/wp\/v2\/media?parent=311"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}