Options
All
  • Public
  • Public/Protected
  • All
Menu

Class SubSModuleTokenizer

Hierarchy

Implements

Index

Constructors

constructor

Properties

Protected Optional _BLACKLIST

_BLACKLIST: IDICT_BLACKLIST

Protected Optional _POSTAG

_POSTAG: POSTAG

Protected Optional _TABLE

_TABLE: any

Optional inited

inited: boolean

name

name: string

Optional priority

priority: number

segment

segment: Segment

type

type: "tokenizer" = "tokenizer"

Static type

type: "tokenizer" = "tokenizer"

Methods

Protected _cache

  • _cache(...argv: any[]): void

Protected _splitUnknow

  • _splitUnknow<T, U>(words: T[], fn: function): U[]
  • 仅对未识别的词进行匹配 包含已存在 但 p 為 0

    Type parameters

    Parameters

    • words: T[]
    • fn: function
        • (text: string, ...argv: any[]): U[]
        • Parameters

          • text: string
          • Rest ...argv: any[]

          Returns U[]

    Returns U[]

Protected _splitUnset

  • _splitUnset<T, U>(words: T[], fn: function): U[]
  • 仅对未识别的词进行匹配 不包含 p 為 0

    Type parameters

    Parameters

    • words: T[]
    • fn: function
        • (text: string, ...argv: any[]): U[]
        • Parameters

          • text: string
          • Rest ...argv: any[]

          Returns U[]

    Returns U[]

Protected createRawToken

Protected createToken

  • createToken<T, U>(data: T, skipCheck?: boolean, attr?: U & IWordDebugInfo): T

Protected debugToken

  • debugToken<T, U>(data: T, attr?: U & IWordDebugInfo, returnToken?: true, ...argv: any[]): T

init

  • init(segment: Segment, ...argv: any[]): this

Protected sliceToken

  • sliceToken<T, U>(words: T[], pos: number, len: number, data: T, skipCheck?: boolean, attr?: U & IWordDebugInfo): T[]

Abstract split

Static Protected _init

Static init

  • init<T>(segment: Segment, ...argv: any[]): T

Generated using TypeDoc