Performance Notes
Although Textanz uses the best known text processing algorithms, calculation procedures can be time consuming. From mathematical point of view both memory and duration of the algorithm are almost proportional to the size of the text. However, there are some other factors that influence Textanz functioning, e.g. the consistence of text.
The most complicated and longest operation is phrase frequency calculation. When only frequencies of single words are needed, we recommend Concordance tab.
For large text corpus (hundreds of pages) we suggest first to start the frequency calculation and see how quick it goes. Progress bar will help to make the rough estimate of required time. If operation seems to be too lengthy , you can break text into smaller parts and then join results in Excel using import. At the same time, such approach implies a risk of losing some frequencies , so it is not recommended when precise information are needed.
Textanz can reuse some internal data for further operations, if text and certain settings has not been changed. For example, changing the "minimal frequency" or "exclude ignored words" settings does not require full recalculation.
|