"高通量基因组学:Apache Spark让数据处理触手可及"
需积分: 5 29 浏览量
更新于2024-03-14
收藏 9.64MB PDF 举报
The presentation "High-throughput Genomics at Your Fingertips with Apache Spark" at the Spark Summit EU 2016 in Brussels showcased the journey of KeyGene, a company specializing in genomics, into utilizing Apache Spark for analyzing genomics data. The speaker, Erwin Datema, emphasized that while he was not a computer scientist or a data scientist, he was able to effectively use Spark for interactive genomics data processing and querying.
KeyGene's goal in adopting Apache Spark was to enable faster and more efficient analysis of large-scale genomics data sets. The presentation was told from a user's perspective, providing valuable insights into how Spark can be leveraged by domain experts in genomics to accelerate their research.
The speaker highlighted the importance of high-throughput genomics in advancing biological research and how Apache Spark has transformed the way genomics data is analyzed. By utilizing Spark's distributed computing capabilities, KeyGene was able to significantly reduce the time and resources required for analyzing complex genomics data sets.
Overall, the presentation underscored the tremendous potential of Apache Spark in revolutionizing genomics research and how it has empowered scientists like Erwin Datema to perform high-throughput genomics analysis at their fingertips. Through KeyGene's successful adoption of Spark, it has become clear that this technology is not just for computer scientists or data scientists but can be effectively utilized by domain experts in various fields to unlock new insights and drive innovation in their research.
2019-08-29 上传
2024-01-10 上传
2023-05-23 上传
2023-03-29 上传
2023-09-24 上传
2023-04-12 上传
2023-06-06 上传
ElasticsearchClient,ElasticsearchAsyncClient,ElasticsearchClient,RestClient,ElasticsearchTransport区别
2023-04-05 上传
2023-05-11 上传
weixin_40191861_zj
- 粉丝: 77
- 资源: 1万+
最新资源
- 计算机人脸表情动画技术发展综述
- 关系数据库的关键字搜索技术综述:模型、架构与未来趋势
- 迭代自适应逆滤波在语音情感识别中的应用
- 概念知识树在旅游领域智能分析中的应用
- 构建is-a层次与OWL本体集成:理论与算法
- 基于语义元的相似度计算方法研究:改进与有效性验证
- 网格梯度多密度聚类算法:去噪与高效聚类
- 网格服务工作流动态调度算法PGSWA研究
- 突发事件连锁反应网络模型与应急预警分析
- BA网络上的病毒营销与网站推广仿真研究
- 离散HSMM故障预测模型:有效提升系统状态预测
- 煤矿安全评价:信息融合与可拓理论的应用
- 多维度Petri网工作流模型MD_WFN:统一建模与应用研究
- 面向过程追踪的知识安全描述方法
- 基于收益的软件过程资源调度优化策略
- 多核环境下基于数据流Java的Web服务器优化实现提升性能