sunpinyin and it's future
DESCRIPTION
TRANSCRIPT
![Page 2: Sunpinyin and it's future](https://reader036.vdocuments.site/reader036/viewer/2022081813/549f00e1ac79594c768b48f5/html5/thumbnails/2.jpg)
Who Am I
• Just call me Mike.
• Used to be a student of Zhejiang University, Hangzhou. Nowadmitted by University of Toronto.
• Was born in Tianjin. (That’s why I’m here.)
• Has been using Linux since high school. A Fedora user now.
• Has been a sunpinyin committer since 2009 winter.
![Page 3: Sunpinyin and it's future](https://reader036.vdocuments.site/reader036/viewer/2022081813/549f00e1ac79594c768b48f5/html5/thumbnails/3.jpg)
Pinyin Input Method
• Show several candidates according to Pinyin that user inputs.
• Lots of commercial and free implementation.• Pinyin ABC• Microsoft Pinyin• Sougou Pinyin• QQ Pinyin
![Page 4: Sunpinyin and it's future](https://reader036.vdocuments.site/reader036/viewer/2022081813/549f00e1ac79594c768b48f5/html5/thumbnails/4.jpg)
Approach
• Dictionary based• A dictionary that contains all possible words.• Always look up from the dictionary upon user input.• Will adjust the order of candidates upon user commit.
• Pros: Easy to implement.
• Cons: Not intelligence enough when words are combined.
• Implementation:• Pinyin ABC (Commercial)• Fcitx (GPL)• ibus-pinyin (GPL)
![Page 5: Sunpinyin and it's future](https://reader036.vdocuments.site/reader036/viewer/2022081813/549f00e1ac79594c768b48f5/html5/thumbnails/5.jpg)
Approach
• N-Gram based• Have a database of conditional probability.• Will try to calculate the sentence with the larger probability.• Interpolate between user commit history and database.
• Pros: Intelligence!
• Cons: Where can I get the database?
• Implementation:• Sunpinyin (LGPL/CDDL)• Sougou Pinyin (Commercial)
![Page 6: Sunpinyin and it's future](https://reader036.vdocuments.site/reader036/viewer/2022081813/549f00e1ac79594c768b48f5/html5/thumbnails/6.jpg)
Sunpinyin and OpenGram Project
• Sunpinyin is a input method using N-Gram based method.
• Free as in freedom. LGPL license.
• It’s using tri-grams for build-in and bi-grams for user history.
• OpenGram project aims at creating a tri-gram database forSimplified Chinese.
• Free as in freedom. CC license.
![Page 7: Sunpinyin and it's future](https://reader036.vdocuments.site/reader036/viewer/2022081813/549f00e1ac79594c768b48f5/html5/thumbnails/7.jpg)
Current Progress - Sunpinyin
• Released 2.0.3, and we’re working on 2.1/2.5 release.
• Works on Linux, BSD and OSX platform, with nativeinterface.
• Ported to Ibus(ibus-sunpinyin), Scim(scim-sunpinyin). Alsohave a standalone version(xsunpinyin).
• Current Progress of next release.• Multiple Best Sentence. done• Partial Sentence. done• Plugin Support. WIP
• Needs maintainer!
![Page 8: Sunpinyin and it's future](https://reader036.vdocuments.site/reader036/viewer/2022081813/549f00e1ac79594c768b48f5/html5/thumbnails/8.jpg)
What Do I Need to Know Before JoiningSunpinyin?
• Passion.
• A little C++ knowledge. (Nobody in the team know C++completely. :) )
• That’s enough! and maybe plus• Ibus/Scim API, Xorg API or OSX API.• Python API. (Plugin Support)• Windows API. (Windows Port?)
![Page 9: Sunpinyin and it's future](https://reader036.vdocuments.site/reader036/viewer/2022081813/549f00e1ac79594c768b48f5/html5/thumbnails/9.jpg)
Q&A