close
Jump to content

Semantic analysis (machine learning)

From Wikipedia, the free encyclopedia

In machine learning, semantic analysis of a text corpus is the task of building structures that approximate concepts from a large set of documents. It generally does not involve prior semantic understanding of the documents.

Semantic analysis strategies include:

Stochastic semantic analysis

[edit]

Stochastic semantic analysis is an approach used in computer science as a semantic component of natural language understanding.

Stochastic models generally use the definition of segments of words as basic semantic units for the semantic models, and in some cases involve a two layered approach.[3]

Example applications have a wide range. In machine translation, it has been applied to the translation of spontaneous conversational speech among different languages.[4] In the area of spoken language understanding the fact that spoken sentences often do not follow the grammar of a language and involve self-corrections, repetitions, and other irregularities, the use of stochastic semantic has been suggested as a natural fit to achieve robustness to deal with noise due to the spontaneous nature of spoken language.[5]

See also

[edit]

References

[edit]
  1. ^ Nitin Indurkhya; Fred J. Damerau (22 February 2010). Handbook of Natural Language Processing. CRC Press. ISBN 978-1-4200-8593-8.
  2. ^ Michael Spranger (15 June 2016). The evolution of grounded spatial language. Language Science Press. ISBN 978-3-946234-14-2.
  3. ^ Language Understanding Using Two-Level Stochastic Models by F. Pla, et al, 2001, Springer Lecture Notes in Computer Science ISBN 978-3-540-42557-1
  4. ^ W. Minkera, M. Gavaldàb and A. Waibel Stochastically-based semantic analysis for machine translation in Computer Speech & Language Volume 13, Issue 2, April 1999, Pages 177-194
  5. ^ R. De Mori et al, Spoken language understanding in IEEE Signal Processing Magazine, May 2008 Volume: 25 Issue: 3, pages 50 - 58 ISSN 1053-5888