Multi-core clusters provide a huge amount of computing resources and use a hierarchically structured interconnection network. In this talk, we consider hierarchically structured parallel tasks to improve application performance on multi-core clusters. In particular, we consider scheduling and mapping techniques for parallel tasks that take the architecture of the target systems into consideration. We evaluate the impact of scheduling and mapping for different application programs on different parallel machines