LanguageIdentifier

Value Members

final def !=(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def !=(arg0: Any): Boolean

Definition Classes
Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def ==(arg0: Any): Boolean

Definition Classes
Any
final def asInstanceOf[T0]: T0

Definition Classes
Any
def classify(text: String, threshold: Double, minTextSize: Int): Option[(String, Double)]

Classify an indivitual text for the language of the text
Classify an indivitual text for the language of the text
text
the text to classify
threshold
the minimum score threshold to consider a valid prediction
minTextSize
the minimum length for the text to make a prediction
returns
a pair of code (ISO-639-1 langauge code) and prediction score if a prediction could be made, otherwise None

Definition Classes
LanguageIdentifier → LanguageIdentifier
def classify(text: String): Option[(String, Double)]

Classify an indivitual text for the language of the text
Classify an indivitual text for the language of the text
This method should use sensible defaults for the threshold and minTextScore parameters
returns
a pair of code (ISO-639-1 langauge code) and prediction score if a prediction could be made, otherwise None

Definition Classes
LanguageIdentifier
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
lazy val defaultFrequency: Double
lazy val defaultMinTextSize: Int
lazy val defaultThreshold: Double
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def hashCode(): Int

Definition Classes
AnyRef → Any
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
val model: Model
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
def summarize(texts: TraversableOnce[String], threshold: Double, frequency: Double, minTextSize: Int): Vector[(String, Double, Double)]

Given a list of strings, returns an ordered list of unique identified languages using ISO-639-1 langauge code.
Given a list of strings, returns an ordered list of unique identified languages using ISO-639-1 langauge code.
This is built to classify many text entries, with the assumption that we only neeed to knowabout the most common langagues in the texts.
Change threshold and frequency to deal with outlier data. Increasing threshold increases the confidence of identified languages, while increasing frequency reduces impact of minor second language usage.
texts
the texts to classify and summarize
threshold
the
returns
Vector of 3-tuples (lang-code, avg-lang-classification-score, frequency)

Definition Classes
LanguageIdentifier → LanguageIdentifier
def summarize(texts: TraversableOnce[String]): Vector[(String, Double, Double)]

Given a list of strings, returns an ordered list of unique identified languages using ISO-639-1 langauge code.
Given a list of strings, returns an ordered list of unique identified languages using ISO-639-1 langauge code.
This is built to classify many text entries, with the assumption that we only neeed to knowabout the most common langagues in the texts.
Change threshold and frequency to deal with outlier data. Increasing threshold increases the confidence of identified languages, while increasing frequency reduces impact of minor second language usage.
returns
Vector of 3-tuples (lang-code, avg-lang-classification-score, frequency)

Definition Classes
LanguageIdentifier
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toString(): String

Definition Classes
AnyRef → Any
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )

object LanguageIdentifier extends LanguageIdentifier

Value Members

final def !=(arg0: AnyRef): Boolean

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: AnyRef): Boolean

final def ==(arg0: Any): Boolean

final def asInstanceOf[T0]: T0

def classify(text: String, threshold: Double, minTextSize: Int): Option[(String, Double)]

def classify(text: String): Option[(String, Double)]

def clone(): AnyRef

lazy val defaultFrequency: Double

lazy val defaultMinTextSize: Int

lazy val defaultThreshold: Double

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def finalize(): Unit

final def getClass(): Class[_]

def hashCode(): Int

final def isInstanceOf[T0]: Boolean

val model: Model

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

def summarize(texts: TraversableOnce[String], threshold: Double, frequency: Double, minTextSize: Int): Vector[(String, Double, Double)]

def summarize(texts: TraversableOnce[String]): Vector[(String, Double, Double)]

final def synchronized[T0](arg0: ⇒ T0): T0

def toString(): String

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from LanguageIdentifier

Inherited from AnyRef

Inherited from Any

Ungrouped