https://github.com/microsoft/Recognizers-Text Skip to content Toggle navigation Sign up * Product + Actions Automate any workflow + Packages Host and manage packages + Security Find and fix vulnerabilities + Codespaces Instant dev environments + Copilot Write better code with AI + Code review Manage code changes + Issues Plan and track work + Discussions Collaborate outside of code + Explore + All features + Documentation + GitHub Skills + Blog * Solutions + For + Enterprise + Teams + Startups + Education + By Solution + CI/CD & Automation + DevOps + DevSecOps + Case Studies + Customer Stories + Resources * Open Source + GitHub Sponsors Fund open source developers + The ReadME Project GitHub community articles + Repositories + Topics + Trending + Collections * Pricing [ ] * # In this repository All GitHub | Jump to | * No suggested jump to results * # In this repository All GitHub | Jump to | * # In this organization All GitHub | Jump to | * # In this repository All GitHub | Jump to | Sign in Sign up {{ message }} microsoft / Recognizers-Text Public * Notifications * Fork 414 * Star 1.4k Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, and date/time expressed in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV). Contributions are greatly welcome! Packages are available at https://www.nuget.org/profiles/Recognizers.Text and https:// www.npmjs.com/~recog... License MIT license 1.4k stars 414 forks Star Notifications * Code * Issues 141 * Pull requests 24 * Discussions * Actions * Projects 3 * Wiki * Security * Insights More * Code * Issues * Pull requests * Discussions * Actions * Projects * Wiki * Security * Insights microsoft/Recognizers-Text This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository. master Switch branches/tags [ ] Branches Tags Could not load branches Nothing to show {{ refName }} default View all branches Could not load tags Nothing to show {{ refName }} default View all tags Name already in use A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch? Cancel Create 6 branches 58 tags Code * Local * Codespaces * Clone HTTPS GitHub CLI [https://github.com/m] Use Git or checkout with SVN using the web URL. [gh repo clone micros] Work fast with our official CLI. Learn more. * Open with GitHub Desktop * Download ZIP Sign In Required Please sign in to use Codespaces. Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Launching Xcode If nothing happens, download Xcode and try again. Launching Visual Studio Code Your codespace will open once ready. There was a problem preparing your codespace, please try again. Latest commit @Neelisha-saxena Neelisha-saxena [EN DateTimeV2] Extended recurrence support in extractor (#3009) ... 1d910cb Dec 22, 2022 [EN DateTimeV2] Extended recurrence support in extractor (#3009) 1d910cb Git stats * 2,017 commits Files Permalink Failed to load latest commit information. Type Name Latest commit message Commit time .NET .github/ISSUE_TEMPLATE Java JavaScript Patterns Python Script Specs Tools .gitignore CODE_OF_CONDUCT.md CONTRIBUTING.md LICENSE README.md SECURITY.md appveyor.yml build.cmd View code Microsoft Recognizers Text Overview Utilizing the Project Citing the Recognizers-Text project Help Contributing Supported Entities across Cultures README.md Microsoft Recognizers Text Overview Build Status Build Status Microsoft.Recognizers.Text provides robust recognition and resolution of entities like numbers, units, and date/time; expressed in multiple languages. Full support for Chinese, English, French, Spanish, Portuguese, German, Italian, Turkish, Hindi, and Dutch. Partial support for Japanese, Korean, Arabic, and Swedish. More on the way. Utilizing the Project Microsoft.Recognizers.Text powers pre-built entities in LUIS: Language Understanding Intelligent Service, Power Virtual Agents, and Microsoft Bot Framework; base entity types in Text Analytics Cognitive Service; and it is also available as standalone packages (for the base classes and the different entity recognizers). The Microsoft.Recognizers.Text packages currently target four platforms: * C#/.NET - NuGet packages available at: https://www.nuget.org/ profiles/Recognizers.Text * JavaScript/TypeScript - NPM packages available at: https:// www.npmjs.com/~recognizers.text * Python - PyPI packages available at: https://pypi.org/user/ recognizers-text/ (alpha) * Java (in progress) Contributions are greatly welcome! Both for fixes and extensions in the currently supported languages and for expansion to new ones. Especially for Japanese, Korean, Arabic, Swedish, and others! More info below. .NET is the primary package version and contributions propagate to the other platforms with time. Citing the Recognizers-Text project If you utilize the recognizers in academic works, please cite it as below (you can omit the version number or update it to a specific version if relevant): @software{soft:recognizers-text, author = {Wenhao Huang and Zijia Lin and Chris McConnell and B{\"{o}}rje F. Karlsson}, title = {{Recognizers-Text}: {R}ecognition and resolution of numbers, units, and date/time entities expressed across multiple languages}, month = jul, year = 2017, publisher = {Zenodo}, version = {1.0.0}, doi = {10.5281/zenodo.6860598}, url = {https://doi.org/10.5281/zenodo.6860598} } Feel free to change "@software" to "@misc" if it better fits your templates. Help If you have any questions, please go ahead and open an issue, even if it's not an actual bug. Issues are an acceptable discussion forum as well. Contributing This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments. Good starting points for contribution are: * the list of open issues (especially those marked as help wanted); * the json spec cases temporarily marked as NotSupported (Specs); and * translating json test spec cases that work in English, but don't yet exist in a target language. The links below describe the project structure and provide both an overview and tips on how to contribute (although some steps may have become a little out-of-date). Thank you! * Overview and language resources * Implementing language specific behaviour * Test specs and testing in general Supported Entities across Cultures The table below summarizes the currently supported entities. Support for English is usually more complete than others. The primary platform is .NET (shown in table) and support should propagate to the others. Entity Type EN ZH-CN NL FR DE IT JA KO PT ES Number (cardinal) Ordinal Percentage Number Range PA/EO Unit - Age PA/EO Unit - Currency PA/EO Unit - Dimensions PA/EO Unit - Temperature Choice - Boolean SO Seq. - E-mail G G* G G G G G* G* G G Seq. - GUID G G G G G G G G G G Seq. - Social G G G G G G G G G G Seq. - IP Address G G G G G G G G G G Seq. - Phone Number G G G G G G G G G G Seq. - URL G G* G G G G G* G* G G DateTime (+subtypes) SO Entity Type SV BG TR HI AR Number (cardinal) PA/EO Ordinal PA/EO Percentage PA/EO Number Range PA/EO Unit - Age Unit - Currency Unit - Dimensions Unit - Temperature Choice - Boolean Seq. - E-mail G G G G G Seq. - GUID G G G G G Seq. - Social G G G G G Seq. - IP Address G G G G G Seq. - Phone Number Seq. - URL G G G G* G* DateTime (+subtypes) SP SO * G: Generic entity, not language-specific (* unicode TLDs not-supported); * EO: Extraction-only (parsing/resolution/normalization pending); * PA: Partial support (type not fully supported); * SO: Specs-only (test specs coverage OK, but support pending); * SP: Partial specs; * SI: Very initial specs (typically language support start for a new language). About Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, and date/time expressed in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV). Contributions are greatly welcome! Packages are available at https://www.nuget.org/profiles/Recognizers.Text and https:// www.npmjs.com/~recog... Topics nlp datetime parser-library ner hacktoberfest timex entity-extraction nlp-entity-extraction numex Resources Readme License MIT license Code of conduct Code of conduct Security policy Security policy Stars 1.4k stars Watchers 65 watching Forks 414 forks Releases 57 Release of Recognizers-Text Nuget packages v1.8.6 Latest Dec 6, 2022 + 56 releases Packages 0 No packages published Used by 217 * @Gasmimed19 * @alejus27 * @microsoft * @kriebb * @RenattoB * @RenattoB * @alokahr * @alokahr + 209 Contributors 145 * @tellarin * @JuanAr * @aitelint * @sanxing-chen * @guom08 * @Grey0202 * @songwenhao1 * @aitelintII * @pcostantini * @enzocano * @dependabot[bot] + 134 contributors Languages * C# 41.9% * Java 19.4% * Python 18.8% * JavaScript 9.9% * TypeScript 9.8% * HTML 0.1% * Other 0.1% Footer (c) 2023 GitHub, Inc. Footer navigation * Terms * Privacy * Security * Status * Docs * Contact GitHub * Pricing * API * Training * Blog * About You can't perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session.