I've seen the committers thread on trying to improve dataclasses' start time performance but it seemed focused on the execution time, while that is an area to look at I think it may be worth looking at improving the import time of dataclasses itself.
I've been working on my own implementation of the same idea and had
found that importing some stdlib modules had a significant impact on the overall time it took to run. On looking at
dataclasses
I noticed that it also imported some of the same modules.