pytorch/text: Data loaders and abstractions for text and NLP

3197
STARS
262
WATCHERS
775
FORKS
304
ISSUES

text's Language Statistics

pytorch's Other Repos

Star history of text
Issue history of text

text Recent Issues

Issue Title State Comments Created Date Updated Date Closed Date
Add new TorchText logo to README and documentation open 0 2023-01-26 2023-01-30 -
Reduce patches necessary to build for conda-forge open 0 2023-01-24 2023-01-30 -
Make torchtext available on conda-forge closed 30 2023-01-13 2023-01-30 2023-01-20
[GPT2BPETokenizer] Out-of-memory issue during training when special token was added through "add_special_tokens()" closed 0 2023-01-10 2023-02-01 2023-01-19
Increase memory for integration tests open 2 2022-12-29 2023-01-18 -
Fix lint errors in `validate_binaries.yml` open 1 2022-12-20 2023-01-30 -
Save and loading vocabaluray open 1 2022-12-13 2023-01-25 -
Torchtext nightly version not available for Linux from December 01, 2022 closed 1 2022-12-09 2023-01-18 2022-12-13
Change transforms in RoBERTa into classes open 0 2022-12-07 2023-01-18 -
One of the three datasets returned by `Multi30k` seems to be bugged. closed 2 2022-12-07 2023-02-01 2022-12-07
Saving label vocab for serving closed 0 2022-11-30 2023-01-18 2022-11-30
Cannot run text_classifier end to end open 2 2022-11-27 2023-01-18 -
Torchtext addresses empty memory and fails with OSError closed 2 2022-11-16 2023-01-18 2022-11-18
Update readme to correct release versions. closed 0 2022-11-02 2023-01-18 2022-11-04
TorchText nightly release is based on PyTorch 1.13.0 official release closed 4 2022-11-01 2023-01-18 2022-11-07
0.14.0: incorrect version tag closed 1 2022-10-29 2023-01-18 2022-11-01
Can't import torchtext closed 6 2022-10-26 2023-01-18 2022-12-13
Error when importing torchtext: "No module named 'regex'" closed 0 2022-10-19 2023-01-31 2022-10-19
Ensure `main` and `fbsync` are both in sync open 0 2022-10-18 2023-02-01 -
Add `LengthSetterIterDataPipe` to all torchtext datasets open 2 2022-10-13 2023-01-18 -
torchtext.transforms does not provide custom tokenization open 2 2022-10-09 2023-01-18 -
Super slow performance open 3 2022-10-06 2023-01-26 -
Failed to import my built torchtext closed 8 2022-10-05 2023-01-28 2022-12-16
test_iwslt2016 is flaky and fails periodically on different platforms closed 2 2022-10-04 2023-01-21 2022-10-06
Libtorchtext Bert Model open 0 2022-10-04 2023-01-09 -
torchtext.data.Field closed 1 2022-10-04 2023-01-22 2022-10-04
Investigate flaky test `test_download_charngram_vectors` on Windows open 0 2022-09-27 2023-01-15 -
IWSLT datasets are not properly unpacked from `tgz` file closed 9 2022-09-15 2022-11-30 2022-10-20
cannot import name 'BERTTokenizer' from 'torchtext.transforms' closed 2 2022-09-14 2023-01-30 2022-09-15
Investigate `test_vocab_from_raw_text_file` failures on Linux open 0 2022-09-13 2022-11-26 -
Unclear how `torchtext.data.functional.generate_sp_model` handles CSV files open 0 2022-09-09 2023-02-01 -
[CI] Unittests fail on Windows with Python 3.10 closed 0 2022-09-01 2023-01-02 2022-09-03
Moving from Google Drive downloads to GitHub Releases closed 2 2022-09-01 2023-01-29 2022-09-09
Allowing more control over verbosity of `torchtext.data.functional.generate_sp_model` open 3 2022-08-31 2023-01-06 -
Add `never_split` kwarg to BERTTokenizer to achieve parity with `transformers.BertTokenizer` closed 0 2022-08-18 2022-12-01 2022-09-19
Add Example of Machine Translation using mBART model to TorchText open 0 2022-08-08 2023-01-14 -
Error running unit tests when building with setup.py install closed 3 2022-08-05 2023-01-14 2022-09-16
The Vocab function in torchtext still have some issues. It's only works with python 3.9.12 torchtext 0.12.0 pytorch 1.11.0 and not compatible with other versions open 4 2022-07-27 2023-01-10 -
Support for example/label slices batching in DataPipe closed 0 2022-07-22 2023-01-15 2022-07-22
Installing torchdata results in torchtext breaking closed 4 2022-07-22 2023-01-13 2022-08-17
Compatibility error between torch 1.9 and torchtext 0.10 closed 0 2022-07-21 2023-01-12 2022-07-21
Scripted Roberta model stuck in second inference call open 2 2022-07-20 2023-01-10 -
Can't pickle local object 'to_map_style_dataset.<locals>._MapStyleDataset' closed 9 2022-07-19 2023-01-24 2022-07-21
Off by one (OBO) error in torchtext implementation of BERTTokenizer closed 0 2022-07-15 2023-01-09 2022-07-18
Add Nightly releases for macos < 11 with x86_64 closed 6 2022-07-13 2023-01-31 2022-08-10
NameError: name 'IterableWrapper' is not defined in WikiText2 closed 5 2022-07-09 2023-01-25 2022-10-20
Add T5 Model and Demo on Text Summarization using CNNDM Dataset closed 0 2022-06-21 2023-01-24 2022-08-17
Unable to compile torchtext v0.13 closed 20 2022-06-20 2023-01-13 2022-07-11
'MapperIterDataPipe' object is not an iterator --- Error encountered when iterating IMDB Dataset closed 7 2022-06-20 2023-01-23 2022-06-20
Segmentation fault on loading a tokenizer with torch.jit.load + map_location open 4 2022-06-15 2023-01-11 -