Mark Johnston
2014-11-16 19:41:34 UTC
Hello,
just a quick email, I've recently started a port of Open NLP to the .NET
framework, and have just created the first commit on github, in case anyone
is interested.
From the Readme:
"opennlp4net is a port of OpenNLP to the .NET framework. The code is
written in C#, and is very much a work-in-progress. The code in this
library should not even be considered alpha, and only exists because there
are no other open source natural language tools available for .NET.
The code was initially ported using an automated tool for the mechanical
conversion, then attempts were made to fix functionality piece by piece,
function by function, by replacing JAVA-isms with .NET equivalents. The
result is not pretty, but a number of the tools work via the API:
the sentence detector
the tokenizer
the namefinder
the chunker
the postagger
The parser does not work currently, neither does any functionality relating
to training the models."
The repository is at: https://github.com/quicquam/opennlp4net.
If anyone wants to help me to get this port up to speed, please contact me
via this forum. The solution includes models (which the port uses
unconverted), and test projects which have an absolute minimal test set for
the tools listed above.
I am currently trying to get the parser tool working, but having issues I
think related to the fact that the model file is nested, and the code that
I have written can't cope with that yet.
Cheers,
Mark
just a quick email, I've recently started a port of Open NLP to the .NET
framework, and have just created the first commit on github, in case anyone
is interested.
From the Readme:
"opennlp4net is a port of OpenNLP to the .NET framework. The code is
written in C#, and is very much a work-in-progress. The code in this
library should not even be considered alpha, and only exists because there
are no other open source natural language tools available for .NET.
The code was initially ported using an automated tool for the mechanical
conversion, then attempts were made to fix functionality piece by piece,
function by function, by replacing JAVA-isms with .NET equivalents. The
result is not pretty, but a number of the tools work via the API:
the sentence detector
the tokenizer
the namefinder
the chunker
the postagger
The parser does not work currently, neither does any functionality relating
to training the models."
The repository is at: https://github.com/quicquam/opennlp4net.
If anyone wants to help me to get this port up to speed, please contact me
via this forum. The solution includes models (which the port uses
unconverted), and test projects which have an absolute minimal test set for
the tools listed above.
I am currently trying to get the parser tool working, but having issues I
think related to the fact that the model file is nested, and the code that
I have written can't cope with that yet.
Cheers,
Mark