Data-intensive applications are gaining more and more importance in HPC. The emergence of new applications that require enormous amount of data has revealed new problems that need to dealt with when executing such jobs in heterogeneous and geographically distributed environments. In our presentation we will describe some of the challenges in this area and directions which we have taken in order to solve this issues.