Text Processing (21)

  • Overall Score
  • Popularity
  • Trending
  • Activity
  • Maturity
870Top 50%
P A M T

pygments pygments

Pygments is a generic syntax highlighter written in Python
PythonBSD-2-ClauseDeep bench • 730 contributors • 6 years
488Top 50%
P A M T

pyparsing pyparsing

Python library for creating PEG parsers
PythonMITStar magnet • 2.4k stars
440Top 50%
P A M T

daviddrysdale python-phonenumbers

Python port of Google's libphonenumber
PythonApache-2.0Star magnet • 3.7k stars
383
P A M T

andialbrecht sqlparse

A non-validating SQL parser module for Python
PythonBSD-3-ClauseStar magnet • 3.9k stars
379
P A M T

mozillazg python-pinyin

汉字转拼音(pypinyin)
PythonMITStar magnet • 5.2k stars
317
P A M T

chardet chardet

Python character encoding detector
PythonLGPL-2.1Star magnet • 2.3k stars
307
P A M T

life4 textdistance

📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
PythonMITStar magnet • 3.5k stars
305
P A M T

dabeaz ply

Python Lex-Yacc
PythonStar magnet • 2.9k stars
300
P A M T

rspeer python-ftfy

Fixes mojibake and other glitches in Unicode text, after the fact.
PythonStar magnet • 4k stars
297
P A M T

pwaller pyfiglet

An implementation of figlet written in Python
PythonStar magnet • 1.5k stars
284
P A M T

skorokithakis shortuuid

A generator library for concise, unambiguous and URL-safe UUIDs.
PythonBSD-3-ClauseStar magnet • 2.2k stars
281
P A M T

un33k python-slugify

Returns unicode slugs
PythonMITStar magnet • 1.6k stars
276
P A M T

derek73 python-nameparser

A simple Python module for parsing human names into their individual components
PythonStar magnet • 691 stars
273
P A M T

selwin python-user-agents

A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.
PythonMITStar magnet • 1.5k stars
258
P A M T

davidaurelio hashids-python

Implementation of hashids (http://hashids.org) in Python. Compatible with Python 2 and Python 3
PythonMITStar magnet • 1.4k stars
257
P A M T

avian2 unidecode

ASCII transliterations of Unicode text - GitHub mirror
PythonGPL-2.0Star magnet • 592 stars
256
P A M T

ztane python-Levenshtein

The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
CGPL-2.0Star magnet • 1.3k stars
224
P A M T

voronind awesome-slugify

Python flexible slugify function
Python490 stars
217
P A M T

mozilla unicode-slugify

A slugifier that works in unicode
PythonBSD-3-Clause321 stars
205
P A M T

vinta pangu.py

Paranoid text spacing in Python
PythonMIT268 stars
199
P A M T

yelsayd pydiff

A simple GUI for python's difflib to compare files and directories
PythonMIT136 stars