The HourGlass corpus is collection of 348 documents (short texts) in Spanish tagged with temporal expressions following the TimeML standard. Since it was concieved as a test bed for temporal taggers, each document has an attached a tag and a registy as classification.
The corpus is divided in two parts, depending of the source of the texts.
Additionally, we make available the result of three different temporal taggers on this corpus. The metrics obtained by these taggers (calculated using the software GATE) against the key set of annotations for each file and feature (extent, type and value of each tag) can be found next to each tagger below:
These files can be loaded into GATE as a corpus to facilitate visualization and comparison. This was the software that generated the previous statistics.    
The corpus is freely downloadable under a GNU General Public License v3.0 license.
If you plan to publish a work using this resource please refer to this webpage (and come back in a few months, hopefully we will have a paper to refer to!)
We would also want to thank the contributors of the PEOPLE part of the corpus.