You have trade-offs for everything, and "messy" data is one you have to eat if you're mixing potentially bad deployments with good, I don't think there's a way around that. If you're at that scale you're going to be doing some data massaging anyways— from special database instances just for reports, to picking what kind of storage for what kind of data.
The nice thing about dev/qa/prod (or as many as you need, but separate, including the DB) is you don't have to worry about messy data as much. You run the env, do the tests, have clients do some tests, and then afterwards all that data is ignored as you do the prod deploy.
This also lets your clients go nuts in testing stuff, etc., and makes handling tickets far easier.
So much is relative to what you have, and what you want to have, it's hard to say what's best, but FWIW where RDS shines is in cloning massive DBs and whatnot— though there is lots of vendor lock-in at that point, you've probably already bought into AWS so… "in for a penny, in for 10 billion dollars unexpectedly on your next bill", as they say 😜