File:Karsha Annotation Architecture-V-1.1.png: Difference between revisions

From ngfci
Jump to navigation Jump to search
mNo edit summary
No edit summary
Line 1: Line 1:
'''1- Client Web Browser-(Presentation tier)'''
'''1- Client Web Browser-(Presentation tier)'''
The presentation layer provides the application’s user interface (UI).  The Graphical user interfaces are designed with HTML,CSS, JSP, JavaScript, JQuery technologies for smart and easy user interaction.  
The presentation layer provides the application’s user interface (UI).  The Graphical user interfaces are designed with HTML,CSS, JSP, JavaScript, JQuery technologies for smart and easy user interaction.  
'''2-WSo2 Application Server (Logic tier)'''  
'''2-WSo2 Application Server (Logic tier)'''  
This layer controls the application’s functionality by performing detailed processing and all the processes are running inside the WSo2 cloud.
This layer controls the application’s functionality by performing detailed processing and all the processes are running inside the WSo2 cloud.
'''2.1-Libraries'''
'''2.1-Libraries'''
'''2.1.1-Apache Lucene'''
'''2.1.1-Apache Lucene'''
Lucene is an open source full text index and search engine written in Java. The functionalities are mainly used to index PDF  document and calculating similarity between documents.  
Lucene is an open source full text index and search engine written in Java. The functionalities are mainly used to index PDF  document and calculating similarity between documents.  
'''2.1.2-Apache PDFBox'''
'''2.1.2-Apache PDFBox'''
The Apache PDFBox library is an open source Java tool for working with PDF documents. It is used for manipulate PDF documents and  to extract content from PDF documents.  
The Apache PDFBox library is an open source Java tool for working with PDF documents. It is used for manipulate PDF documents and  to extract content from PDF documents.  
'''2.1.3-Stanford Parser'''
'''2.1.3-Stanford Parser'''
Set of open source probabilistic natural language parsers written in Java. It can be used for split sentences, extract named entities, get grammatical structure of sentences. Extracting sentences is the main usage of Stanford Parsee in this context.
Set of open source probabilistic natural language parsers written in Java. It can be used for split sentences, extract named entities, get grammatical structure of sentences. Extracting sentences is the main usage of Stanford Parsee in this context.
'''2.2- Processes'''
'''2.2- Processes'''
'''2.2.1-Document Processor'''
'''2.2.1-Document Processor'''
Controlling of text extraction form PDF documents
Controlling of text extraction form PDF documents
Processing of text and extraction of sentences
Processing of text and extraction of sentences
'''2.2.2-Doc_Sim Calculator'''
'''2.2.2-Doc_Sim Calculator'''
Calculation of similarity between financial documents and FIBO definitions using Okapi Similarity
Calculation of similarity between financial documents and FIBO definitions using Okapi Similarity
Recommending FIBO terms for documents based on document similarity values.
Recommending FIBO terms for documents based on document similarity values.
'''3-Data Service Server (Data tier)'''
'''3-Data Service Server (Data tier)'''
Data service server housed inside the WSo2 cloud and having separate server improves scalability and performance. A SQL database has been used for data storage.
Data service server housed inside the WSo2 cloud and having separate server improves scalability and performance. A SQL database has been used for data storage.
Financial Industry Business Ontology is staged in a database table.
Financial Industry Business Ontology is staged in a database table.
Various financial documents are stored as blob files.
Various financial documents are stored as blob files.

Revision as of 06:49, 28 September 2012

1- Client Web Browser-(Presentation tier) The presentation layer provides the application’s user interface (UI). The Graphical user interfaces are designed with HTML,CSS, JSP, JavaScript, JQuery technologies for smart and easy user interaction. 2-WSo2 Application Server (Logic tier) This layer controls the application’s functionality by performing detailed processing and all the processes are running inside the WSo2 cloud. 2.1-Libraries 2.1.1-Apache Lucene Lucene is an open source full text index and search engine written in Java. The functionalities are mainly used to index PDF document and calculating similarity between documents. 2.1.2-Apache PDFBox The Apache PDFBox library is an open source Java tool for working with PDF documents. It is used for manipulate PDF documents and to extract content from PDF documents. 2.1.3-Stanford Parser Set of open source probabilistic natural language parsers written in Java. It can be used for split sentences, extract named entities, get grammatical structure of sentences. Extracting sentences is the main usage of Stanford Parsee in this context. 2.2- Processes 2.2.1-Document Processor Controlling of text extraction form PDF documents Processing of text and extraction of sentences 2.2.2-Doc_Sim Calculator Calculation of similarity between financial documents and FIBO definitions using Okapi Similarity Recommending FIBO terms for documents based on document similarity values. 3-Data Service Server (Data tier) Data service server housed inside the WSo2 cloud and having separate server improves scalability and performance. A SQL database has been used for data storage. Financial Industry Business Ontology is staged in a database table. Various financial documents are stored as blob files.

File history

Click on a date/time to view the file as it appeared at that time.

Date/TimeThumbnailDimensionsUserComment
current09:48, 31 October 2012Thumbnail for version as of 09:48, 31 October 2012626 × 811 (77 KB)Kasunp@opensource.lk (talk | contribs)Reverted to version as of 09:45, 31 October 2012
09:46, 31 October 2012Thumbnail for version as of 09:46, 31 October 2012626 × 811 (77 KB)Kasunp@opensource.lk (talk | contribs)Reverted to version as of 09:44, 31 October 2012
09:45, 31 October 2012Thumbnail for version as of 09:45, 31 October 2012626 × 811 (77 KB)Kasunp@opensource.lk (talk | contribs)User Friendly colors
09:44, 31 October 2012Thumbnail for version as of 09:44, 31 October 2012626 × 811 (77 KB)Kasunp@opensource.lk (talk | contribs)Made colors more userfriendly
06:37, 28 September 2012Thumbnail for version as of 06:37, 28 September 2012626 × 811 (84 KB)Kasunp@opensource.lk (talk | contribs)1- Client Web Browser-(Presentation tier) The presentation layer provides the application’s user interface (UI). The Graphical user interfaces are designed with HTML,CSS, JSP, JavaScript, JQuery technologies for smart and easy user interaction. 2-WSo

The following page uses this file: