A Reliable Large Distributed Object Store Based Platform for Collecting Event Metadata

Springer Science and Business Media LLC - Tập 19 - Trang 1-19 - 2021

Álvaro Fernández Casaní¹, Juan M. Orduña², Javier Sánchez¹, Santiago González de la Hoz¹

¹Instituto de Física Corpuscular (IFIC), Universidad de Valencia and CSIC, Valencia, Spain

²Departamento de Informática, Universidad de Valencia, Valencia, Spain

Tóm tắt

The Large Hadron Collider (LHC) is about to enter its third run at unprecedented energies. The experiments at the LHC face computational challenges with enormous data volumes that need to be analysed by thousands of physics users. The ATLAS EventIndex project, currently running in production, builds a complete catalogue of particle collisions, or events, for the ATLAS experiment at the LHC. The distributed nature of the experiment data model is exploited by running jobs at over one hundred Grid data centers worldwide. Millions of files with petabytes of data are indexed, extracting a small quantity of metadata per event, that is conveyed with a data collection system in real time to a central Hadoop instance at CERN. After a successful first implementation based on a messaging system, some issues suggested performance bottlenecks for the challenging higher rates in next runs of the experiment. In this work we characterize the weaknesses of the previous messaging system, regarding complexity, scalability, performance and resource consumption. A new approach based on an object-based storage method was designed and implemented, taking into account the lessons learned and leveraging the ATLAS experience with this kind of systems. We present the experiment that we run during three months in the real production scenario worldwide, in order to evaluate the messaging and object store approaches. The results of the experiment show that the new object-based storage method can efficiently support large-scale data collection for big data environments like the next runs of the ATLAS experiment at the LHC.

Tài liệu tham khảo

ATLAS Collaboration: The ATLAS experiment at the CERN Large Hadron Collider. J. Instrum. 3(08), S08003 (2008) Barberis, D., Cárdenas Zárate, SE, Cranshaw, J., Favareto, A., Fernández Casaní, A, Gallas, E.J., Glasman, C., González De La Hoz, S, Hřivnáč, J, Malon, D., Prokoshin, F., Salt Cairols, J., Sánchez, J, Többicke, R, Yuan, R.: The ATLAS EventIndex: Architecture, design choices, deployment and first operation experience. J. Phys.: Conf. Ser. 664(4), 042003 (2015) Barberis, D., Cranshaw, J., Favareto, A., Fernández Casaní, A, Gallas, E., González de la Hoz, S, Hřivnáč, J, Malon, D., Nowak, M., Prokoshin, F., Salt, J., Sánchez Martínez, J, Többicke, R, Yuan, R.: The ATLAS EventIndex: Full chain deployment and first operation. Nuclear and Particle Physics Proceedings 273-275, 913–918 (2016) White, T.: Hadoop: The definitive guide. O’Reilly Media, Inc. (2012) Sánchez, J, Casaní, AF, de la Hoz, S.G.: Distributed data collection for the ATLAS EventIndex. J. Phys.: Conf. Ser. 664(4), 042046 (2015) Salomoni, D., Campos, I., Gaido, L., de Lucas, J.M., Solagna, P., Gomes, J., Matyska, L., Fuhrman, P., Hardt, M., Donvito, G., et al: Indigo-datacloud: A platform to facilitate seamless access to e-infrastructures. J Grid Computing 16(3), 381–408 (2018). https://doi.org/10.1007/s10723-018-9453-3 Krašovec, B, Filipčič, A: Enhancing the grid with cloud computing. J Grid Computing 17(1), 119–135 (2019) Hagras, T., Atef, A., Mahdy, Y.B.: Greening duplication-based dependent-tasks scheduling on heterogeneous large-scale computing platforms. J. Grid Comput. 19(1), 13 (2021) Activemq, http://activemq.apache.org/ Fernandez Casani, A., Sanchez, J., Gonzalez de la Hoz, S., Orduña, JM: Designing Alternative Transport Methods for the Distributed Data Collection of ATLAS EventIndex Project. http://cds.cern.ch/record/2235644/files/ATL-SOFT-SLIDE-2016-869.pdf (2016) Karol, M., Hluchyj, M., Morgan, S.: Input versus output queueing on a space-division packet switch. IEEE Transactions on Communications 35(12), 1347–1356 (1987). https://doi.org/10.1109/tcom.1987.1096719 Mesnier, M., Ganger, G.R., Riedel, E.: Object-based storage. IEEE Commun. Mag. 41(8), 84–90 (2003). https://doi.org/10.1109/MCOM.2003.1222722 Rabbitmq, http://www.rabbitmq.com/ Apache flume, http://flume.apache.org/ Logstash data processing pipeline, https://www.elastic.co/products/logstash Kreps, J., Narkhede, N., Rao, J., et al.: Kafka: A distributed messaging system for log processing. In: Proceedings of NetDB, pp 1–7 (2011) Dobbelaere, P., Esmaili, K.S.: Kafka versus rabbitmq: A comparative study of two industry reference publish/subscribe implementations: Industry paper. In: Proceedings of the 11th ACM International Conference on Distributed and Event-based Systems, DEBS ’17, ACM, New York, NY, USA, pp 227–238. https://doi.org/10.1145/3093742.3093908 (2017) Bird, I.G.: Lhc computing (wlcg): Past, present, and future. Grid and Cloud Computing: Concepts and Practical Applications 192, 1 (2016) Garonne, V., Vigne, R., Stewart, G., Barisits, M., Lassnig, M., Serfon, C., Goossens, L., Nairz, A., Collaboration, A., et al: Rucio–the next generation of large scale distributed system for atlas data management. In: Journal of Physics: Conference Series, IOP Publishing, 513, pp 042021 (2014) Calafiura, P., De, K., Guan, W., Maeno, T., Nilsson, P., Oleynik, D., Panitkin, S., Tsulaia, V., Gemmeren, P.V., Wenaus, T.: The atlas event service: A new approach to event processing. J. Phys.: Conf. Ser. 664(6), 062065 (2015) Amazon simple storage service developer guide, http://docs.aws.amazon.com/AmazonS3/latest/dev/s3-dg.pdf Weil, S.A., Brandt, S.A., Miller, E.L., Long, D.D.E., Maltzahn, C.: Ceph: A scalable, high-performance distributed file system. In: Proceedings of the 7th Symposium on Operating Systems Design and Implementation, OSDI ’06, USENIX Association, Berkeley, CA, USA, pp 307–320 (2006) Ceph, http://ceph.com/ Amazon s3, cloud computing storage for files, images, videos. http://aws.amazon.com Google protocol buffers: Google’s data interchange format. https://developers.google.com/protocol-buffers/

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Về chúng tôi

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích các bài báo, công bố khoa học Việt Nam. Công cụ trợ giúp người nghiên cứu, tạp chí, đơn vị nghiên cứu tra cứu, phân tích và thống kê dữ liệu nghiên cứu khoa học tại Việt Nam và quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia vào Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Công cụ kiểm tra chính tả và thể thức Viver

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA