The zero-based index of the document in the input list.
The syntax tokens for the words in the document, one token for each word.