Options
All
  • Public
  • Public/Protected
  • All
Menu

Hierarchy

  • any
    • PunctuationTokenizer

Index

Properties

STOPWORD

STOPWORD: object = STOPWORD

Type declaration

  • [key: string]: number

STOPWORD2

STOPWORD2: object = STOPWORD2

Type declaration

  • [key: number]: object
    • [key: string]: number

_STOPWORD

_STOPWORD: string | string[] = _STOPWORD

name

name: string = "PunctuationTokenizer"

Methods

matchStopword

  • matchStopword(text: string, cur?: number): IWord[]
  • 匹配包含的标点符号,返回相关信息

    Parameters

    • text: string

      文本

    • Optional cur: number

      开始位置

    Returns IWord[]

    返回格式 {w: '网址', c: 开始位置}

split

Generated using TypeDoc