Post
454
If you're coming towards Information Retrieval with pre-processing techniques for LLM, this post might be relevant.
Excited to share of releasing a new 0.25.1 version of the AREkit library! ππ₯³ππ
AREkit represent an NLP toolkit of components for deep understanding textual narratives through the extraction of inner relations via various techniqes, including machine learning techniques. This toolkit is helpful if you wish to structure your dataset for IR problem. It allows you to turn your narratives into structured datasets of mentioned relations in sentences (sampling).
In the era of GenAI world, AREkit contributes with no-string NLP pipelines and related elements for building your own NLP workflow with any thirdparty ML / LLM / API you wish.
π https://github.com/nicolay-r/AREkit/releases/tag/v0.25.1-rc
In 0.25.1, the following steps were made towards it:
1. β Native batching support for pipelines
2. π¦ Formed thirdparty projects for several text-preprocessing elements:
bulk-translate with GoogleTranslate or any other you wish: https://github.com/nicolay-r/bulk-translate
bulk-ner for NER with DeepPavlov models or any other you wish: https://github.com/nicolay-r/bulk-ner
bulk-chain for reasoning with any LLM you wish: https://github.com/nicolay-r/bulk-chain
* (soon support for AREkit)
3. β Removed convential neural network related components
πΊ One of the demo is ARElight which repsent a granular viewer / GUI for network-based representation of infromation extracted from narratives:
ARElight: https://github.com/nicolay-r/ARElight
Excited to share of releasing a new 0.25.1 version of the AREkit library! ππ₯³ππ
AREkit represent an NLP toolkit of components for deep understanding textual narratives through the extraction of inner relations via various techniqes, including machine learning techniques. This toolkit is helpful if you wish to structure your dataset for IR problem. It allows you to turn your narratives into structured datasets of mentioned relations in sentences (sampling).
In the era of GenAI world, AREkit contributes with no-string NLP pipelines and related elements for building your own NLP workflow with any thirdparty ML / LLM / API you wish.
π https://github.com/nicolay-r/AREkit/releases/tag/v0.25.1-rc
In 0.25.1, the following steps were made towards it:
1. β Native batching support for pipelines
2. π¦ Formed thirdparty projects for several text-preprocessing elements:
bulk-translate with GoogleTranslate or any other you wish: https://github.com/nicolay-r/bulk-translate
bulk-ner for NER with DeepPavlov models or any other you wish: https://github.com/nicolay-r/bulk-ner
bulk-chain for reasoning with any LLM you wish: https://github.com/nicolay-r/bulk-chain
* (soon support for AREkit)
3. β Removed convential neural network related components
πΊ One of the demo is ARElight which repsent a granular viewer / GUI for network-based representation of infromation extracted from narratives:
ARElight: https://github.com/nicolay-r/ARElight