The most used research materials in the Language Bank of Finland are
- The Finnish Text Collection ftc,
- The Finnish-Swedish Text Collection fstc and
- The Helsinki Corpus of Swahili hcs.
The text can be found on the server hippu.csc.fi and, for the most part, through CSC's Scientist's Interface query tool Lemmie.
All the available materials are listed on the page Software and databases.
Frequency lists
A frequency list of the 9996 most common lemmas in Finnish newspaper text can be downloaded below. The frequency list is freely available for research according to the Creative Commons license linked on the download page.
Other frequency lists are available on the server hippu.csc.fi. Other frequency lists available on the web include the list published by the Finnish Research Institute for Languages here.