Mike Brummett's picture

Mike Brummett PRO

GoDjMike

AI & ML interests

Edge detection, road anomaly identification, story-generation libraries

Recent Activity

Organizations

None yet

GoDjMike's activity

reacted to nicolay-r's post with šŸš€ 2 days ago
view post
Post
3676
šŸ“¢ If you're looking for translating massive dataset of JSON-lines / CSV data with various set of source fields, then the following update would be relevant. So far and experimenting with adapting language specific Sentiment Analysis model, got a change to reforge and relaese bulk-translate 0.25.2.
ā­ļø https://github.com/nicolay-r/bulk-translate/releases/tag/0.25.2

The update has the following major features
- Supporting schemas: all the columns to be translated are now could be declared within the same prompt-style format. using json this automatically allows to map them onto output fields
- The related updates for shell execution mode: schema parameter is now available alongside with just a prompt usage before.

Benefit is that your output is invariant. You can extend and stack various translators with separated shell laucnhes.

Screenshot below is the application of the google-translate engine in manual batching mode.
šŸš€ Performance: 2.5 it / sec (in the case of a single field translation)

šŸŒŸ about bulk-translate: https://github.com/nicolay-r/bulk-translate
šŸŒŒ nlp-thirdgate: https://github.com/nicolay-r/nlp-thirdgate?tab=readme-ov-file
  • 1 reply
Ā·
reacted to clem's post with šŸ”„ 30 days ago
view post
Post
7187
AI is not a zero-sum game. Open-source AI is the tide that lifts all boats!