Published in Towards Data Science·2 days agoMember-onlyHow to Speed Up Training for Large Language ModelsExplore the concept of sequence parallelism and selective activation re-computation — With the advancement of artificial intelligence, it is now possible to train large language models for natural language processing tasks. Typically, a large language model contains more than 100 billion parameters and is trained using advanced algorithms on a large corpus. Large language models (LLM) are very effective and able…NLP6 min read
Published in Towards Data Science·Jul 28Member-onlyMultilingual Text Similarity Matching using EmbeddingUsing sentence-transformer for symmetric semantic search — The topic for today is about calculating the similarity score between two sentences of the same or different languages. We will be utilizing the sentence-transformer framework which comes with its own pre-trained multilingual transformer models. We can make use of these models to compute the text embedding for more than…NLP7 min read
Published in Better Programming·Jul 25Member-onlyHow to Save and Move Docker ImagesMoving doesn’t always have to be hard — After reading this piece, you will learn to package your Docker images and deploy them on other machines locally. The process for copying a Docker image can be confusing for most developers as Docker comes with different commands for saving/loading and exporting/importing. …Programming5 min read
Published in Better Programming·May 27Member-onlyConvert Text to Phoneme in PythonSimple phonemization of words and texts in many languages — By reading this piece, you will learn how to convert an input text string to its corresponding phonemes in Python. A phoneme represents the smallest unit of sound in a language. For example, the word tab consists of three phonemes: /t/ /a/ /b/ The element b is distinguishable when compared…Programming5 min read
Published in Towards Data Science·May 19Member-only20 Open-Source Single Speaker Speech DatasetsA comprehensive open-source multi-lingual speech data — Speech synthesis, also known as text-to-speech (TTS) is one of the new key technologies in the artificial intelligence domain. It provides the capabilities to generate human-like voices from text input dynamically. TTS can be applied in a variety of purposes and tied closely with automation services. However, training a text-to-speech…Text To Speech8 min read
Published in Towards Data Science·Apr 28Member-onlyConvert PASCAL VOC XML to YOLO for Object DetectionTips and tricks to preprocess image datasets — This tutorial covers the following step-by-step guides: convert XML annotations to YOLO annotations visualize the bounding boxes in image using the newly created YOLO annotations split the datasets into train, validation and test sets Overview PASCAL VOC XML The PASCAL Visual Object Classes (VOC) project is one of the earliest computer vision project that…Object Detection9 min read
Published in Better Programming·Apr 15Member-onlyData Augmentation With AugLyAll-in-one augmentation packages for machine learning — In the world of machine learning, data augmentation is one of the most useful techniques to enhance the performance of ML models. Data augmentation serves to create synthetic data via slight modification or transformation to the existing data. This helps to: increase the amount of training and test data. reduce…Programming4 min read
Published in Level Up Coding·Mar 8Member-onlyHow to Remove Personally Identifiable Information (PII) from Audio and VideosRemoving Personally Identifiable Information (PII) from transcription text In my previous article, I have covered How to Transcribe Audio Files to Text. In this tutorial, let’s explore a little further on how to remove Personally Identifiable Information (PII) from the transcription. …Python6 min read
Published in Level Up Coding·Mar 2Member-onlyProfanity Filtering in SpeechReplace offensive words with asterisks Previously, I have covered a tutorial on Speech Content Safety Detection, which identifies sensitive content such as pornography, terrorism, and hate speech in speech. …Python5 min read
Published in Level Up Coding·Feb 21Member-onlyHow to Detect Topics in SpeechIdentify relevant topics based on IAB Taxonomy Topic detection is a technique to discover the abstract topic behind a collection of documents. It is mostly part of the natural language processing technique to classify text into specific topics/domains. Although there are no rules and regulations on how topics should be…Python5 min read