Welcome! Here is how you can find something:
There are lots of other options! Click at the top to find out.
Grammar selection body
The help text should be loaded here.
The dictionary text should be loaded here.
The metadata table should be loaded here.
Национальный корпус таджикского языка (tajik-corpus.org).
If you want to share your query with someone, send them the text you see below. The person who loads this query will see the same results in the same order, unless the corpus has been re-indexed.
Here you can load a corpus query someone sent you. Please enter the query below:
|Value||Size in words||Number of documents|
|Query word 1|
|frequency (ipm)||90% conf. int.|
The plot below works as follows. On the x axis, you see frequency ranks, i.e. positions in the full list of all word forms / lemmata of the corpus, ordered by decreasing frequency. If multiple words/lemmata have the same frequency, they get the same rank equal to the average of their positions. For each frequency rank r, the plot shows the proportion of words/lemmata that conform to your query (each word counts only once) among all words with frequency rank less or equal to r on the y axis. The rightmost point, therefore, shows the total proportion of such words/lemmata among all types (different words) in the corpus. In the case of lemmata, all lemmata that have at least one word form conforming to the query are counted.
The subcorpus constraints and all words in the query, except the first one, are not taken into account here.