On the other hand, are you planning for the situation where an OSD node
dies? Motherboard, power supply, etc.?
Yes, as the document at the first link talks about this. However, saying accurately we have to choose between two ways:
1. Rebalance the cluster.
2. Stop the cluster for maintenance.
And the second variant is not rare: most backup systems, for example, might be stopped at production time without direct impact to business availability.