Conditional random fields in text segmentation by language

Authors

DOI:

https://doi.org/10.18046/syt.v15i43.2712

Keywords:

Text segmentation by language, conditional random fields.

Abstract

This work presents using conditional random fields for solving the task of text segmentation by language, considering it as a sequence tagging task. Language changes are considered to occur in every part of the text, observations are assumed to be the words in the text, and the states are the different languages. Research let conclude that conditional random fields are a powerful tool for segmentation of multilingual text. 

Author Biography

  • Robin Cabeza Ruiz, University of Holguín

    Master in Design Assisted by Computer from the Universidad de Holguín (Cuba, 2015) with a bachelor’s degree in Computer Science from Universidad de Oriente (Cuba, 2017). Currently he is professor of informatics II and member of CAD/CAM Studies Center at the Faculty of Engineering at the Universidad de Holguín. His main areas of interest in research are biomechanical and text segmentation by computer. 

Downloads

Published

2017-12-06

Issue

Section

Discussion papers