You are here

Data-intensive computing

Systems and problems that include huge data volumes and complex patterns of integration and interaction.

New Staff: Dr Paolo Besana

We welcome Dr Paolo Besana to the Group, who will be working on data-intensive computing machinery.
Paolo is in Room IF5.22.

Research topics: 
Topic of this submission: 

Surfing for earthquakes

A better understanding of the ground beneath our feet will result from research by seismologists and Rapid—a group of computer scientists at the University of Edinburgh. The Earth's structure controls how earthquakes travel and the damage they can cause. A clear picture of this structure would be extremely valuable to earthquake planners, but it requires the analysis of huge amounts of data. The Rapid team developed a system that performs the seismologists' data-crunching, and have made it easy to use by relying on an interface familiar to all scientists – a web browser.

Topic of this submission: 
Projects: 

Screencast:A Rapid portal for computational chemistry on HECToR, the UK-national academic supercomputer

Below a screencast where Rapid was used to develop a portal for the UK-national academic supercomputer HECToR. The portal shows how to setup an advanced compute job involving computational chemistry. You need Flash installed in the browser to watch the video below. Click here for a large version

Topic of this submission: 
Projects: 

New Staff: Dr Michelle Galea

We welcome Michelle, who is going to work on the ADMIRE project. She is in office 5.22.

Topic of this submission: 
Projects: 

TOPP goes Rapid

S. Gesing, van Hemert, J., Koetsier, J., Bertsch, A., and Kohlbacher, O., TOPP goes Rapid, in Cluster Computing and the Grid, IEEE International Symposium on, Los Alamitos, CA, USA, 2010, p. 598--599.

Towards Optimising Distributed Data Streaming Graphs using Parallel Streams

Speaker(s): 
Presentation Type: 
talk

Modern scientific collaborations have opened up the opportunity of solving complex problems that involve multi- disciplinary expertise and large-scale computational experiments. These experiments usually involve large amounts of data that are located in distributed data repositories running various software systems, and managed by different organisations. A common strategy to make the experiments more manageable is executing the processing steps as a workflow.

Date and time: 
Tuesday, 22 June, 2010 - 11:30
Location: 
The Third International Workshop on Data Intensive Distributed Computing, Chicago, Illinois, US
Research topics: 
Projects: 

Hazard forecasting in real time: from controlled laboratory tests to volcanoes and earthquakes

The inherent limits to the predictability of brittle failure events such as earthquakes and volcanic eruptions are important, unknown, and much debated. We will establish techniques to determine what this limit is in the ideal case of controlled laboratory tests, for the first time in real-time, prospective mode, meaning before failure has occurred.

Acronym: 
RapidHazard
Value: 
£526969
Dates: 
Sat, 01/01/2011 to Tue, 12/31/2013
Project members: 
Projects: 

ADMIRE: Facilitating Data Mining and Data Integration

Speaker(s): 
Presentation Type: 
talk

It is evident that data-intensive research is transforming computing landscape. We are facing the challenge of handling the deluge of data generated by sensors and modern instruments that are widely used in all domains. The number of sources of data is increasing, while, at the same time, the diversity, complexity and scale of these data resources are also growing dramatically. To survive the data tsunami, we need to improve our apparatus for the exploration and exploitation of the growing wealth of data.

Date and time: 
Tuesday, 29 June, 2010 - 11:00
Location: 
National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, Illinois, US.
Projects: 
Research topics: 

ADMIRE: Facilitating Data Mining and Data Integration

Speaker(s): 
Presentation Type: 
talk

It is evident that data-intensive research is transforming computing landscape. We are facing the challenge of handling the deluge of data generated by sensors and modern instruments that are widely used in all domains. The number of sources of data is increasing, while, at the same time, the diversity, complexity and scale of these data resources are also growing dramatically. To survive the data tsunami, we need to improve our apparatus for the exploration and exploitation of the growing wealth of data.

Date and time: 
Thursday, 1 July, 2010 - 13:00
Location: 
Computation Institute, University of Chicago, Chicago, Illinois, US.
Research topics: 
Projects: 

Data-Intensive Research

Speaker(s): 
Presentation Type: 
talk

Edinburgh Data-Intensive Research Data-intensive refers to huge volumes of data, complex patterns of data integration and analysis, and intricate interactions between data and users. Current methods and tools are failing to address data-intensive challenges effectively. They fail for several reasons, all of which are aspects of scalability.

Date and time: 
Tuesday, 1 June, 2010 - 14:00
Location: 
iDEA lab bio-medical data day, Informatics Forum, Edinburgh, UK

Visualisation and analysis of stochastic flow through large network diagrams of biological pathways

CISA Seminar Series
Speaker: 
Tom Freeman

Over the last few years we have been developing a graphical language, the modified Edinburgh Pathway Notation scheme (mEPN), with which to logically depict the biological interactions that together make up pathways (1-3). We have used this language to construct a number of large-scale pathways associated with immune signaling and effector systems and have been exploring ways to model the activity of these networks.

Date and time: 
Monday, 7 June, 2010 - 14:00
Length: 
60 minutes
Location: 
IF431
Research topics: 

Screencast: Rapid Portal for real time geophysical experiments

Below a screencast of the first prototype of the scientific gateway for real time geophysical experiments. This portal was created using Rapid.

Topic of this submission: 
Projects: 

Intuitive Large-scale Image Processing for Biologists

Modern cell and developmental biology and the now-established domain of systems biology use quantitative imaging methods to measure the location, dynamics and interaction of molecules in fixed and living cells, and at increasingly high spatial and temporal resolution. Quantitative imaging depends on the development, delivery, and use of sophisticated image processing and analysis algorithms. The availability of these data analysis tools is commonly cited as a major bottleneck in scientific discovery.

Acronym: 
Rapid-OMERO
Value: 
£780956
Dates: 
Thu, 07/01/2010 to Mon, 12/31/2012
Projects: 

RapidSeis: Enabling User-Defined Seismological Waveform Data Processing over the Grid

Speaker(s): 
Presentation Type: 
talk

The objective of this JISC-funded pilot project was to remove perceived barriers to uptake of an application that performs analysis of seismic waveform data. The aim was to provide the seismological community with a simplified system that overcame important barriers such as installation and understanding of the analysis package, location and transfer of large amounts of input data and visualisation of results.

Date and time: 
Tuesday, 4 May, 2010 - 10:30
Location: 
European Geosciences Union, General Assembly 2010, Vienna, Austria
Projects: 

Toward a service-oriented e-infrastructure for data mining and data-intensive modeling applications in seismology

Speaker(s): 
Presentation Type: 
talk

Global and regional seismology monitoring systems are continuously operated and are transmitting a growing wealth of seismological data in Europe and from around the world. This opens exciting opportunities for a large range of geophysical research. The multi-use nature of these data puts a great premium on open-access data archive infrastructures that are well integrated in the European Plate Observing System (EPOS)—an ESFRI initiative of the solid earth community.

Date and time: 
Wednesday, 5 May, 2010 - 09:30
Location: 
European Geosciences Union

Pages

Subscribe to RSS - Data-intensive computing