Skip to content

Commit

Permalink
remove resiDual, as hyperconnections is the culmination for that line…
Browse files Browse the repository at this point in the history
… of research
  • Loading branch information
lucidrains committed Feb 4, 2025
1 parent 66ab2c7 commit 62237f8
Show file tree
Hide file tree
Showing 2 changed files with 56 additions and 76 deletions.
10 changes: 0 additions & 10 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2015,16 +2015,6 @@ ids_out, num_out, is_number_mask = model.generate(start_ids, start_nums, 17)
}
```

```bibtex
@article{Xie2023ResiDualTW,
title = {ResiDual: Transformer with Dual Residual Connections},
author = {Shufang Xie and Huishuai Zhang and Junliang Guo and Xu Tan and Jiang Bian and Hany Hassan Awadalla and Arul Menezes and Tao Qin and Rui Yan},
journal = {ArXiv},
year = {2023},
volume = {abs/2304.14802}
}
```

```bibtex
@inproceedings{Dehghani2023ScalingVT,
title = {Scaling Vision Transformers to 22 Billion Parameters},
Expand Down
Loading

0 comments on commit 62237f8

Please sign in to comment.