Ordered sequence. An ordered sequence over this multiset is also possible. An order over these, corresponding to word order in the document, can also be imposed.

A reasonable simplification is to assume that the word’s position within the document does not affect its conditional probability:



When we become interested in realistic document structures and writing conventions (e.g., abstract paragraphs, introductions and conclusions, spiral expositions of news stories (cf. Section 6.2)), etc., this assumption must be reconsidered.