1
0
Fork 0
CLI tool to search unicode data backed by SQLite3
Go to file
Daniele Tricoli 855d40f0b0 Fix some more formatting 2015-12-12 03:04:58 +01:00
format Add \n between package declaration and imports 2015-12-11 03:43:36 +01:00
sql Complete fixture rows 2015-12-11 03:24:27 +01:00
.gitignore Ignore *.swp and executable 2015-09-07 01:38:49 +02:00
.travis.yml Add travis and coverage support 2015-12-10 23:20:34 +01:00
LICENSE Initial import 2015-09-05 01:39:17 +02:00
README.md Also Capital H is missing 2015-12-12 02:32:16 +01:00
doc.go Fix some more formatting 2015-12-12 03:04:58 +01:00
main.go Enforce custom import path 2015-12-12 02:39:24 +01:00
utils.go Show a progress bar during download 2015-12-08 03:56:07 +01:00

README.md

piken

piken is a CLI tool to search unicode data backed by SQLite3.

The name is in homage to the creators of UTF-8: Rob Pike and Ken Thompson.

Install

% go get eriol.xyz/piken

After installation use piken update to get unicode data from http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.

Usage

% piken -h
NAME:
   piken - unicode search tool backed by SQLite3

USAGE:
   ./piken [global options] command [command options] [arguments...]

VERSION:
   0.1

AUTHOR(S):
   Daniele Tricoli <eriol@mornie.org>

COMMANDS:
   update       Update unicode data
   search, s    Search for unicode
   help, h      Shows a list of commands or help for one command

GLOBAL OPTIONS:
   --help, -h           show help
   --version, -v        print the version

piken exposes SQLite3's full-text search on command line.

NOTE: if you are using zsh, it's useful to disable glob expressions with noglob (using an alias for example) instead of quote searches.

Default fields showed are codepoint and name, but it easy to override this:

% piken search red apple
1F34E -- RED APPLE -- 🍎

piken search --fields Name --fields Category --fields CodePoint red apple
RED APPLE -- So -- 1F34E -- 🍎

The following fields can be used:

  • CodePoint
  • Name
  • Category
  • CanonicalClass
  • BidiClass
  • DecompositionType
  • NumericType
  • NumericDigit
  • NumericValue
  • BidiMirrored
  • Unicode1Name
  • ISOComment
  • SimpleUppercaseMapping
  • SimpleLowercaseMapping
  • SimpleTitlecaseMapping

AND, OR, NOT operators

% piken search heart AND (blue OR green)
1F499 -- BLUE HEART -- 💙
1F49A -- GREEN HEART -- 💚

Note: operators must be uppercase.

* and ^ prefixes

% piken search mass*
1D340 -- TETRAGRAM FOR MASSING -- 𝍀
1F486 -- FACE MASSAGE -- 💆
% piken search ^cat
1F408 -- CAT -- 🐈
1F431 -- CAT FACE -- 🐱
1F639 -- CAT FACE WITH TEARS OF JOY -- 😹
1F63C -- CAT FACE WITH WRY SMILE -- 😼

NEAR operator

% piken search crying face
1F622 -- CRYING FACE -- 😢
1F62D -- LOUDLY CRYING FACE -- 😭
1F63F -- CRYING CAT FACE -- 😿

% piken search crying NEAR/0 face
1F622 -- CRYING FACE -- 😢
1F62D -- LOUDLY CRYING FACE -- 😭
% piken search codepoint:1F602
1F602 -- FACE WITH TEARS OF JOY -- 😂

% piken search doo* codepoint:A*
A585 -- VAI SYLLABLE DOO -- ꖅ
A61B -- VAI SYMBOL DOONG -- ꘛ

% piken search category:Ll | wc -l
1984

% piken search math* frak* cap* NOT bold codepoint:1D50*
1D504 -- MATHEMATICAL FRAKTUR CAPITAL A -- 𝔄
1D505 -- MATHEMATICAL FRAKTUR CAPITAL B -- 𝔅
1D507 -- MATHEMATICAL FRAKTUR CAPITAL D -- 𝔇
1D508 -- MATHEMATICAL FRAKTUR CAPITAL E -- 𝔈
1D509 -- MATHEMATICAL FRAKTUR CAPITAL F -- 𝔉
1D50A -- MATHEMATICAL FRAKTUR CAPITAL G -- 𝔊
1D50D -- MATHEMATICAL FRAKTUR CAPITAL J -- 𝔍
1D50E -- MATHEMATICAL FRAKTUR CAPITAL K -- 𝔎
1D50F -- MATHEMATICAL FRAKTUR CAPITAL L -- 𝔏

# Note: Capital C and Capital H are missing from UnicodeData.txt