View Single Post
Posts: 102 | Thanked: 187 times | Joined on Jan 2010
#14
Originally Posted by FlyingAntero View Post
OK, now I got it. Text corpus makes sence for prediction. I have access to text corpus data which is about 60Gb. Is that enought or should I try to search bigger one? The data that I have found is in different zip files.

EDIT: Here is more information about the data. It is in VRT file format:I would be really grateful for help since my programming skills are very limited.
That sounds like a good start.Then we can see if we need more data from our partner Kielipankki. Make sure to include some social media resources too in the first batch. Multiple source files are no problem. Concatenate them if it feels easier to handle a single source for you.
No problem, happy to help.
 

The Following 3 Users Say Thank You to ljo For This Useful Post: