Home
Register
Login
Browse Resources
Community
Statistics
Help
About
META-SHARE Members
META-SHARE Repositories
META-SHARE Managing Nodes
LR Sharing
Licensing LRs
Notice and Takedown Policy
Privacy
Data Protection
Data Protection Statement
9
Last view: 2024-02-01
Collocation Extractor
Colloc
http://nlptools.racai.ro/
Colloc
ID:
Colloc
A stand-alone application which identifies and extracts collocations along with their contexts of occurence from a given preprocessed text. This tool is language independent.
« Back
Download
You don’t have the permission to edit this resource.
Edit Resource
Distribution
Availability
Available - Restricted Use
Licence
MS Commons - BY - NC - SA
Restrictions:
Inform Licensor, No Redistribution
User Nature:
Academic, Commercial
Distribution Access/Medium:
Accessible Through Interface
Attribution Details:
Please cite this paper: 'Dan Ștefănescu. Intelligent Information Mining from Multilingual Corpora. PhD thesis (in Romanian). Romanian Academy, Bucharest, Romania'
Licensors:
"Mihai Drăgănescu" Research Institute for Artificial Intelligence of the Romanian Academy
http://www.racai.ro
NLP Group
"Mihai Drăgănescu" Research Institute for Artificial Intelligence of the Romanian Academy
RACAI
[javascript protected email address]
Casa Academiei, Calea 13 Septembrie no. 13, floor 3, room 3310
050711 București
România
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
IPR Holder
Dan Ștefănescu
http://www.racai.ro/...
"Mihai Drăgănescu" Research Institute for Artificial Intelligence of the Romanian Academy
RACAI
Principal Researcher 3
[javascript protected email address]
Casa Academiei, Calea 13 Septembrie no. 13, floor 3, room 3318
050711 București
România
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
NLP Group
http://www.racai.ro
RACAI
Casa Academiei, Calea 13 Septembrie no. 13, floor 3, room 3310
050711 București
România
[javascript protected email address]
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
Contact Person
Dan Tufiș
http://www.racai.ro/...
"Mihai Drăgănescu" Research Institute for Artificial Intelligence of the Romanian Academy
RACAI
director
[javascript protected email address]
Casa Academiei, Calea 13 Septembrie no. 13, floor 3, room 3310
050711 București
România
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
NLP Group
http://www.racai.ro
RACAI
Casa Academiei, Calea 13 Septembrie no. 13, floor 3, room 3310
050711 București
România
[javascript protected email address]
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
toolService
Tool
Language Independent
Input
Media type:
Text
Resource type:
Corpus
Modality:
Written Language
Annotation type:
Lemmatization, Morphosyntactic Annotation - Pos Tagging, Segmentation
Segmentation level:
Word
Output
Media type:
Text
Resource type:
Corpus
Modality:
Written Language
Character encoding:
UTF - 8
Annotation format:
text output with one collocation per line and annotations separated by tab
Operation
Operating system:
Windows
Required hardware:
Other
Running environment details:
requires Microsoft .Net Framework 3.5 and 2Gb RAM
Running time:
Depends on the size of the input file: about 5 min for any operation on a 350 Mb input file
Metadata
Created:
07/12/2012
Last Updated:
02/01/2013
Metadata Language:
English
Metadata Creator
Dan Ștefănescu
http://www.racai.ro/...
"Mihai Drăgănescu" Research Institute for Artificial Intelligence of the Romanian Academy
RACAI
Principal Researcher 3
[javascript protected email address]
Casa Academiei, Calea 13 Septembrie no. 13, floor 3, room 3318
050711 București
România
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
NLP Group
http://www.racai.ro
RACAI
Casa Academiei, Calea 13 Septembrie no. 13, floor 3, room 3310
050711 București
România
[javascript protected email address]
Tel.: 0040 21 3188103
Fax: 0040 21 3188142
Version
Version:
1.5
Documentation
Document Type:
Manual
Dan Ștefănescu,
Collocation Extractor
,
http://ws.racai.ro:9...
Keywords:
collocation extractor, application
Document Language:
English
People who looked at this resource also viewed the following:
Language Identifier
German and Russian gold standard for knowledge-rich context extraction
CLUVI Parallel Corpus
Maltese Speech Engine Database