Legal Docs Portal |
|
Do you have Documents in Akoma Ntoso XML format? Our portal software is Akoma Ntoso XML aware and supports managing, indexing and searching Akoma Ntoso documents. See our online demo.
Bungeni Consulting's specialized parser mapped to the Akoma Ntoso XML standard allows easy conversion of plain text, word or PDF documents into Akoma Ntoso XML documents. Our design also supports entry points to plug-in custom processing during the conversion process; the text document is converted to a semantic JSON format which can be processed further if need be by intermediate custom processing before final conversion to XML.
Bungeni Consulting's specialized parser mapped to the Akoma Ntoso XML standard allows easy conversion of plain text, word or PDF documents into Akoma Ntoso XML documents. Our design also supports entry points to plug-in custom processing during the conversion process; the text document is converted to a semantic JSON format which can be processed further if need be by intermediate custom processing before final conversion to XML.
STEP 1. Process Text
We prepare the text to a consistent format, for the converter. |
STEP 2. Semantic JSON
We parse the text into a semantic JSON object which can be processed further if desired. |
STEP 3. Akoma Ntoso XML
The Semantic JSON is converted to Akoma Ntoso XML. |
Click here for a live demo of the parser
Our parser converter design allows using the same basic parser to handle text from different languages. This allows us to identify parse and identify semantic metadata in the text across multi-language documents.
For instance in an english document a date appears as "12th January 2011", and in the corresponding Spanish translation of the document the date may appear as "12 de enero de 2011", but both need to identified as a semantic machine readable date "2011-01-12"; similarly the city Johannesburg appears as "Johannesburg" in an English document, and in the corresponding Portuguese document as "Joanesburgo", and "جوهانسبرج" in the Arabic translation. All three of these names need to be semantically tagged as the same location "Johannesburg" - the parser identifies such semantics across languages.
For instance in an english document a date appears as "12th January 2011", and in the corresponding Spanish translation of the document the date may appear as "12 de enero de 2011", but both need to identified as a semantic machine readable date "2011-01-12"; similarly the city Johannesburg appears as "Johannesburg" in an English document, and in the corresponding Portuguese document as "Joanesburgo", and "جوهانسبرج" in the Arabic translation. All three of these names need to be semantically tagged as the same location "Johannesburg" - the parser identifies such semantics across languages.