Describir: N-gram-based content indexing