Python normalize_text Examples

Programming Language: Python

Namespace/Package Name: thunder.text_processing.preprocess

Method/Function: normalize_text

Examples at hotexamples.com: 4

Python normalize_text - 4 examples found. These are the top rated real world Python examples of thunder.text_processing.preprocess.normalize_text extracted from open source projects. You can rate examples to help us improve the quality of examples.

Example #1

Show file

def test_normalize_text(inp):
    normalized = normalize_text(inp)
    if hasattr(normalized, "isascii"):
        # Only exists on python 3.7+
        assert normalized.isascii()
    # this will raise an exception if the text is not normalized
    normalized.encode("ascii")

Example #2

Show file

def sample_manifest(sample_data):
    audio_files = get_files(sample_data / "LapsBM-F004", ".wav")

    manifest = sample_data / "test_example_manifest.json"
    with open(manifest, "w", encoding="utf8") as f:
        for fil in audio_files:
            data = {
                "audio_filepath":
                str(fil.resolve()),
                "duration":
                audio_len(fil),
                "text":
                normalize_text(fil.with_suffix(".txt").read_text().strip()),
            }
            json.dump(data, f)
            f.write("\n")
    return manifest

Example #3

Show file

def test_normalize_text_specific_inputs():
    assert normalize_text("áàâã") == "aaaa"
    assert normalize_text("ç") == "c"

Example #4

Show file

File: dataset.py Project: scart97/thunder-speech

 def preprocess_text(self, text: str) -> str:
     normalized = normalize_text(text)
     lower = lower_text(normalized)
     return lower