मराठी भाषा वाचविण्याचा एक प्रयास. इंग्रजी ते मराठीचा शब्दकोश. An attempt to preserve the Marathi language. A lightweight and ad free English to Marathi thesaurus.

Overview

For English, scroll down

मराठी शब्द

मराठी भाषा वाचवण्यासाठी मी हा ओपन सोर्स प्रोजेक्ट सुरू केला आहे.

माझ्या मते, आपली भाषा हळूहळू आणि कोणाचाही लक्षात न येता एका मृत भाषेच्या दिशेने वाटचाल करत आहे. या उपक्रमात सगळ्यांचे स्वागत आहे, ज्यांना कोणाला हा एक गंभीर विषय वाटतो व त्यात काही सुधारणा करण्याची गरज आहे असे वाटते.

अगदी सोप्या रीतीने सांगायचं झाला तर खालील उदाहरण पहा -

१. मराठी वाक्यांमधील इंग्रजी शब्दांचा जास्त आणि अनावश्यक वापर.

  • अयोग्य - "फार bore झालंय. चला एखादा picture बघूया."
  • योग्य - "फार कंटाळा आलाय. चला एखादा चित्रपट बघूया. "

२. देवनगरीऐवजी लॅटिन अक्षरे वापरुन मराठी टायपिंग / लिहिणे

  • अयोग्य - "me tujhya sobat marathi bolat ahe."
  • योग्य - "मी तुझ्या सोबत मराठीत बोलत आहे."

अधिक माहितीसाठी खालील इंग्रजी मजकूर वाचा. आपण सॉफ्टवेअर अभियंते जरी नसाल तरीही आपण योगदान करू शकता.

योगदान करण्यासाठी

१. "Github" वर आपले खाते बनवा

२. "Discussions" पृष्ठावरील आपल्या कल्पना, टिप्पण्या इ. वर चर्चा करा.

Marathi shabd

About

This project is being developed as a part of an effort to help save the Marathi language from its gradual and unnoticeable decline into a dying language.

Goal


(This is the goal of the overall idea and not just this project.)

Revive the usage of Marathi language in its original/unadulterated form in day-to-day life in both spoken and written medium.

How to do it?


  1. Make people realise that these problems exist
  2. Motivate them to work towards fixing it
  3. Provide them with resources (this project basically is a part of this step)
  4. Ask them to do actually implement this in their daily life

This will be done with a combination of videos, blogs and software tools such as this. (Contributions in all these are welcome.)

Overview of this project

The idea is to have a static website (ad free, bloat free and fast) where people, looking to improve their Marathi vocabulary, can search for an English word/phrase and quickly find its Marathi equivalent, and also usage example wherever possible.

Words can also be categorised into various topics (tags) so that words used in same context can be found together to improve the vocabulary those particular topics. More features can be added in the future, if necessary.

So basically it will be an ad-free and fast English-to-Marathi thesaurus for day-to-day words with some additonal features.

Development and contribution

It is currently in its very initial stage where I am conceptualising it and looking for contributors (developers as well as people well versed in the Marathi language).

Some places to do contributions

  • Database update - adding English words with Marathi equivalents
  • Static website creation - Basically parsing the database and creating an output markdown file with all the content. This file will be used on the github.io static website page.
    • note - I would particularly like help in this area as it is new to me as well.
  • Adding/correcting content in Marathi language to this project's documentation (readme, website pages etc.)

(This is the current plan and can be improvised.)

Please suggest your ideas, comments etc. in the "Discussions" page.

I also have in mind quite a few other ideas related to creating resources in Marathi language, which I plan to start once I have this project's website first ready at some usable level.

What is the need to do this?

As I see it, there 2 main problems which are explained below -

  1. Excessive use of English words in Marathi sentences.

Simply stated this is using a lot of English words in our sentences where we could easily use Marathi words. Example -

  • Not OK - "फार bore झालंय. चला एखादा picture बघूया."
  • OK - "फार कंटाळा आलाय. चला एखादा चित्रपट बघूया. "

The direct consequence of this is that we are loosing our grip on the Marathi vocabulary. And this problem is ever growing like a snowball, which needs external force and motivation to fix it. This problem exists in both the spoken as well as the written form. Also while this is particularly serious in the urban population, it may also expand to rural areas as the reach of English schools and the internet widens.

This project currently is for working on the above problem only.

  1. Typing/writing Marathi using the Latin alphabet instead of Devanagari.

This is basically typing Marathi like this

  • Not OK - "me tujhya sobat marathit bolat ahe."
  • OK - "मी तुझ्या सोबत मराठीत बोलत आहे."

This problem is something that I feel should not exist in today's date, as we now have good keyboards for typing in Marathi using Devanagari on all platforms be it mobile or computers. However it continues to exist, as people find it easier to type using Latin alphabet on the qwerty keyboard.

Owner
मुक्त स्त्रोत
मुक्त स्त्रोत
Convolutional 2D Knowledge Graph Embeddings resources

ConvE Convolutional 2D Knowledge Graph Embeddings resources. Paper: Convolutional 2D Knowledge Graph Embeddings Used in the paper, but do not use thes

Tim Dettmers 586 Dec 24, 2022
Phrase-BERT: Improved Phrase Embeddings from BERT with an Application to Corpus Exploration

Phrase-BERT: Improved Phrase Embeddings from BERT with an Application to Corpus Exploration This is the official repository for the EMNLP 2021 long pa

70 Dec 11, 2022
Test finetuning of XLSR (multilingual wav2vec 2.0) for other speech classification tasks

wav2vec_finetune Test finetuning of XLSR (multilingual wav2vec 2.0) for other speech classification tasks Initial test: gender recognition on this dat

8 Aug 11, 2022
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform This repo try to implement iSTFTNet : Fast

Rishikesh (ऋषिकेश) 126 Jan 02, 2023
Repository for Graph2Pix: A Graph-Based Image to Image Translation Framework

Graph2Pix: A Graph-Based Image to Image Translation Framework Installation Install the dependencies in env.yml $ conda env create -f env.yml $ conda a

18 Nov 17, 2022
Application for shadowing Chinese.

chinese-shadowing Simple APP for shadowing chinese. With this application, it is very easy to record yourself, play the sound recorded and listen to s

Thomas Hirtz 5 Sep 06, 2022
This library is testing the ethics of language models by using natural adversarial texts.

prompt2slip This library is testing the ethics of language models by using natural adversarial texts. This tool allows for short and simple code and v

9 Dec 28, 2021
Big Bird: Transformers for Longer Sequences

BigBird, is a sparse-attention based transformer which extends Transformer based models, such as BERT to much longer sequences. Moreover, BigBird comes along with a theoretical understanding of the c

Google Research 457 Dec 23, 2022
HAIS_2GNN: 3D Visual Grounding with Graph and Attention

HAIS_2GNN: 3D Visual Grounding with Graph and Attention This repository is for the HAIS_2GNN research project. Tao Gu, Yue Chen Introduction The motiv

Yue Chen 1 Nov 26, 2022
Code for the paper "Flexible Generation of Natural Language Deductions"

Code for the paper "Flexible Generation of Natural Language Deductions"

Kaj Bostrom 12 Nov 11, 2022
A Python 3.6+ package to run .many files, where many programs written in many languages may exist in one file.

RunMany Intro | Installation | VSCode Extension | Usage | Syntax | Settings | About A tool to run many programs written in many languages from one fil

6 May 22, 2022
Module for automatic summarization of text documents and HTML pages.

Automatic text summarizer Simple library and command line utility for extracting summary from HTML pages or plain texts. The package also contains sim

Mišo Belica 3k Jan 08, 2023
2021语言与智能技术竞赛:机器阅读理解任务

LICS2021 MRC 1. 项目&任务介绍 本项目基于官方给定的baseline(DuReader-Checklist-BASELINE)进行二次改造,对整个代码框架做了简单的重构,对核心网络结构添加了注释,解耦了数据读取的模块,并添加了阈值确认的功能,一些小的细节也做了改进。 本次任务为202

roar 29 Dec 05, 2022
Smart discord chatbot integrated with Dialogflow

academic-NLP-chatbot Smart discord chatbot integrated with Dialogflow to interact with students naturally and manage different classes in a school. De

Tom Huynh 5 Oct 24, 2022
Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR

Speech_38_ru_commands Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR Программа умеет распознавать 38 ключевы

Andrey 9 May 05, 2022
English loanwords in the world's languages

Wiktionary as CLDF Content cldf1 and cldf2 contain cldf-conform data sets with a total of 2 377 756 entries about the vocabulary of all 1403 languages

Viktor Martinović 3 Jan 14, 2022
skweak: A software toolkit for weak supervision applied to NLP tasks

Labelled data remains a scarce resource in many practical NLP scenarios. This is especially the case when working with resource-poor languages (or text domains), or when using task-specific labels wi

Norsk Regnesentral (Norwegian Computing Center) 850 Dec 28, 2022
Finetune gpt-2 in google colab

gpt-2-colab finetune gpt-2 in google colab sample result (117M) from retraining on A Tale of Two Cities by Charles Di

212 Jan 02, 2023
A Flask Sentiment Analysis API, with visual implementation

The Sentiment Analysis Api was created using python flask module,it allows users to parse a text or sentence throught the (?text) arguement, then view the sentiment analysis of that sentence. It can

Ifechukwudeni Oweh 10 Jul 17, 2022
HF's ML for Audio study group

Hugging Face Machine Learning for Audio Study Group Welcome to the ML for Audio Study Group. Through a series of presentations, paper reading and disc

Vaibhav Srivastav 110 Jan 01, 2023