Filozofická fakultaUniverzita Komenského v Bratislave


User Interface -Query Types

A. searching for a given word, type it in Hanyu pinyin or in Hanzi 汉字

B. searching for a lemma (a word defined in a dictionary), type it in Hanyu pinyin or in Hanzi 汉字

C. searching for a phrase (a small group of words), type it in Hanz 汉字i with the whitespace inbetween

D. searching for a word, type it in Hanyu pinyin or in Hanzi 汉字

E. searching for a Chinese character (morpheme) in a word, type the character

F. searching for a word, phrase, character, use CQL 



Basics 1 - Searching for a word

Searching for a given word, input the searching word (in Hanyu pinyin or as a Hanzi 汉字) and click "Make Concordance", e.g. guniang or 姑娘, xihuan or 喜欢.

See the video here.

Basics 2 - Sorting results

Sorting the results of a given phrase (in this case 漂亮 的) by frequency:

you can sort the results of a given concordance by frequency of words on the right side (1R) from the node. This would produce the frequency list which contains most frequent words (tokens) with the given phrase (漂亮的). 

See the video.

Basics 3 - Searching for a morpheme

Searching for a morpheme (in this case 员), click on the checkbox "E", input the searching morpheme and "Make Concordance". Then click under Frequency on "Node forms".

See the video.


CQL is Corpus Query Language. The video shows how to search for most frequent verbs in a corpus.

CQL: [tag="VV"]

CQL2 – Sentence Pattern

When searching for a sentence pattern, one should write a CQL expression that represents generalized form of the pattern. In this example, we are searching for the sentence pattern with the preposition BEI (passive voice).


[word="被"] []{0,5} [tag="VV"][]{0,5} within <s/>


VV means verbs

within <s/> means within one sentence

[]{0,5} means from zero up to 5 tokens (any tag)


As it is clear, not all concordances match the given sentence pattern. For more details please see the video.

Regex1 – all disyllabic nouns with the prefix 老

A regular expression, regex or regexp is a sequence of characters that define a search pattern. See more at:


In this example, we would like to search for all nouns that begin with a prefix 老. The easiest way is to write a CQL query as follows:

[word="老."&tag="NN"] in which

[word="老." stands for all disyllabic words with the prefix 老, the symbol "." represents any token and

tag="NN"] stands for all nouns, the symbol "&" represents a Boolean operator AND (conjuction).


See the video.

Regex 2 — all disyllabic verbs with the sufix 出

In this example, we would like to search for all disyllabic verbs that have a sufix 出. The easiest way is to write a CQL query as follows: 

[word=".出" & tag="VV"] where:

[word=".出" stands for all disyllabic verbs with the sufix 出, the symbol (wildcard) "." represents any token and

tag="VV"] stands for all verbs, the symbol "&" represents a Boolean operator AND (conjuction).

1. To perform a search, write a CQL query

2. then click on "Node forms".