University of Toronto


Anglo-Saxon

Each of the approximately 1400 documents are represented in a 3-dimensional vector space (four is the number of topics) where the value of the first coordinate point is the percentage share of document of the first topic, the second coordinate point is the percentage share of document of the second topic, etc. We chose the number of topics as four through trial and error. This choice of four topics resulted in the best thematic clustering within the text

The 3-dimensional vector representation of each of the documents is projected onto a Cartesian plane (via Multidimensional Scaling) in such a way as to preserve, as much as possible, the distances between pairs of documents in the 3-dimensional space. To view the document projections from the DEEDS corpus and corresponding metadata, click below:

Using the document ID number, search below for the document's topic mixture and its metadata information

   Example: 3450001

Search Result:


Topic Mixture:






* 'Internal' indicates date is included in the text.
.. 'External' indicates date is provided by editor based on textual context.


Home Page Main Page