In v10 of europarl we add metadata as extra columns in the tsv file. The columns are: source target (not in monolingual) file_id chapter_id speaker_id speaker name language affiliation Some fields may be blank. There has been no extra data added to europarl since v7