Learning Library

← Back to Library

Edge Computing DNA Sequencing on ISS

Key Points

  • Edge computing relocates compute and storage to where data is generated, slashing latency and the need to transmit large data volumes.
  • The International Space Station (ISS) orbits at about 250 mi in low Earth orbit, using a constellation of geostationary satellites to relay data to ground‑based data centers.
  • Astronauts perform on‑board DNA sequencing for health monitoring, producing raw datasets of several hundred gigabytes that were previously downlinked to Earth for processing.
  • IBM, in partnership with HPE, deployed IBM Cloud‑based, open‑source container workloads on NASA’s SBC2 (Spaceborne Compute) systems via Red Hat OpenShift CRC, handling base‑calling, demultiplexing, alignment, analysis, and report generation directly on the ISS.
  • This edge‑computing approach cuts the end‑to‑end workflow to 6‑8 hours and reduces the final output to under 1 MB, eliminating the need to downlink massive raw data files.

Full Transcript

# Edge Computing DNA Sequencing on ISS **Source:** [https://www.youtube.com/watch?v=5InwlTrl4aY](https://www.youtube.com/watch?v=5InwlTrl4aY) **Duration:** 00:04:29 ## Summary - Edge computing relocates compute and storage to where data is generated, slashing latency and the need to transmit large data volumes. - The International Space Station (ISS) orbits at about 250 mi in low Earth orbit, using a constellation of geostationary satellites to relay data to ground‑based data centers. - Astronauts perform on‑board DNA sequencing for health monitoring, producing raw datasets of several hundred gigabytes that were previously downlinked to Earth for processing. - IBM, in partnership with HPE, deployed IBM Cloud‑based, open‑source container workloads on NASA’s SBC2 (Spaceborne Compute) systems via Red Hat OpenShift CRC, handling base‑calling, demultiplexing, alignment, analysis, and report generation directly on the ISS. - This edge‑computing approach cuts the end‑to‑end workflow to 6‑8 hours and reduces the final output to under 1 MB, eliminating the need to downlink massive raw data files. ## Sections - [00:00:00](https://www.youtube.com/watch?v=5InwlTrl4aY&t=0s) **Edge Computing for ISS DNA Sequencing** - The speaker explains how edge computing places processing and storage on the International Space Station to analyze DNA sequencing data locally, cutting latency and minimizing data transfer to Earth. ## Full Transcript
0:00hello today we will be talking about the 0:02edge computing 0:04in space on the international space 0:05station for the dna sequencing 0:07my name is naim i'm ibm's cto for space 0:10tech 0:11let's talk about edge computing what is 0:13such computing ash computing is about 0:15taking 0:16the compute and storage where the data 0:18is being produced 0:19so we can reduce the latency and reduce 0:23the movement of data 0:24from point a to point b let's first talk 0:26about the international space station 0:28international space station is above 0:31earth in the leo 0:32which is called the lower earth orbit 0:34around 250 miles 0:36above it travels around 17 0:39500 miles per hour and it can circle 0:43every 90 minutes 0:46the way it communicates there are 0:48satellites in the geostationary orbit 0:51it's a constellation of these satellites 0:53from where it relays information 0:54back to the ground stations on the earth 0:57and the data goes into data centers 0:59that's all the mechanism of how the 1:00uplink and downlink works 1:02now let's talk about the dna sequencing 1:05so what is the dna 1:06dna serves as a blueprint for an 1:09organism 1:10dna sequencing is about finding the 1:12characteristics of this organism 1:14it can be very beneficial for monitoring 1:16the astronauts health 1:18or looking for infectious disease on the 1:20space station 1:23the way to date works the astronauts on 1:26the space station 1:27they are scheduled to take a sample from 1:29the surface 1:30and run through this portable dna 1:32sequencer device 1:33it's a physical device which is attached 1:36to a laptop 1:38once the run is done it generates raw 1:41data 1:45which can be several hundred gigabytes 1:48today this data comes down via the 1:51satellites on the ground for processing 1:54and this whole process takes some time 1:56in our collaboration 1:58with hpe who sent their 2:04sbc2 systems space point community 2:06systems in our partnership with them 2:08we determined that we can do the 2:10processing of all of this raw data 2:13which is being produced instead of 2:14bringing that information 2:16down to the earth we can just do all the 2:18processing 2:19we utilized ibm cloud 2:26and we built the open source code using 2:30running it on the red hat 2:34crc which is code ready container 2:36openshift platform 2:38basically it's a set of container 2:40containers code which we built 2:42this code was pushed to nasa and then 2:45nasa pushed this code 2:46up to the space station onto the spc2 2:48computers 2:50now this solution has several steps 2:53because it's a sequential process it has 2:54to go and process all this information 2:56so the first thing the containers when 2:58they come up on the crc platform 3:01they go through a step called base 3:03calling 3:04then the next step is demultiplexing 3:08the third step is the alignment 3:12the fourth step is the analysis 3:16and the last step is actually generating 3:18the report 3:20this whole process can take up to six to 3:22eight hours 3:23and at the end of the day it generates a 3:25file which is less than 3:28one one megabyte so as you can see 3:31is in this edge computing use case where 3:33we were initially 3:34moving hundreds of gigs of data back to 3:37the ground 3:37and processing and taking time in this 3:40case 3:41literally in a day within six to eight 3:43hours we can create the file 3:45the file is less than one meg and we can 3:48send down to the earth 3:49that's the resulting file which the 3:50scientists are looking for 3:52and thus reducing the time latency 3:55preserving the bandwidth which is very 3:56critical at the space station 3:58communications between 4:00because there are many projects 4:01happening so as we said this is an 4:03extreme uh 4:05use case for the edge computing now 4:07think about this as we are moving 4:08forward 4:09going back to the moon as part of rtms 4:11program mars and beyond 4:14if we use this edge computing mechanisms 4:16point devices 4:18as we are heading towards the deeper 4:20space we might be able to detect 4:22possibly 4:23life beyond earth thank you