You are reading a single comment by and its replies. Click here to read the full conversation.
  • @arrowplum

    Hey, thanks for the reply. No idea about right thread. I just searched programming and posted.

    Re: 1.1, the data is not stored in one place/space, unfortunately. It will need to be pulled out of the entry (which reads like any biography would - random). This is why I thought the natural language toolkit may be helpful.

    Re: 4, no - the data will be output into a txt file which can be used with existing statistical analysis software.

    Ultimately, I'm just trying to make my life easier in terms of getting the data. However, it is a skill which could become useful in the future (I can think of one project already which I would have saved myself an afternoon if I could have worked out something that would automatically strip text of particular fields).

  • Ahh yeah. In that case, like you suspected, Python is probably your man. It sounds like the pain of learning it would be useful to you in general (ie this isn't just a one-off).

    That natural language stuff is still only useful to you if you need to figure out things from context (I think).

    If you have the time, I point you in the direction of: http://learnpythonthehardway.org/book
    its a good start especially if you aren't programming in another language already.

  • Oh also, it isn't perfect but this thread http://www.lfgss.com/conversations/242694/ at least has a lot of people on it who could help you too.

About

Avatar for   started