Text this: N-gram-based content indexing