TY - GEN
T1 - The Queen's Guard
T2 - 39th Annual Computer Security Applications Conference, ACSAC 2023
AU - Shaon, Fahad
AU - Rahaman, Sazzadur
AU - Kantarcioglu, Murat
N1 - Publisher Copyright:
© 2023 ACM.
PY - 2023/12/4
Y1 - 2023/12/4
N2 - Distributed data analytics platforms (i.e., Apache Spark, Hadoop) provide high-level APIs to programmatically write analytics tasks that are run distributedly in multiple computing nodes. The design of these frameworks was primarily motivated by performance and usability. Thus, the security takes a back seat. Consequently, they do not inherently support fine-grained access control or offer any plugin mechanism to enable it, making them risky to be used in multi-tier organizational settings. There have been attempts to build "add-on"solutions to enable fine-grained access control for distributed data analytics platforms. In this paper, first, we show that straightforward enforcement of "add-on"access control is insecure under adversarial code execution. Specifically, we show that an attacker can abuse platform-provided APIs to evade access controls without leaving any traces. Second, we designed a two-layered (i.e., proactive and reactive) defense system to protect against API abuses. On submission of a user code, our proactive security layer statically screens it to find potential attack signatures prior to its execution. The reactive security layer employs code instrumentation-based runtime checks and sandboxed execution to throttle any exploits at runtime. Next, we propose a new fine-grained access control framework with an enhanced policy language that supports map and filter primitives. Finally, we build a system named SecureDL with our new access control framework and defense system on top of Apache Spark, which ensures secure access control policy enforcement under adversaries capable of executing code. To the best of our knowledge, this is the first fine-grained attribute-based access control framework for distributed data analytics platforms that is secure against platform API abuse attacks. Performance evaluation showed that the overhead due to added security is low.
AB - Distributed data analytics platforms (i.e., Apache Spark, Hadoop) provide high-level APIs to programmatically write analytics tasks that are run distributedly in multiple computing nodes. The design of these frameworks was primarily motivated by performance and usability. Thus, the security takes a back seat. Consequently, they do not inherently support fine-grained access control or offer any plugin mechanism to enable it, making them risky to be used in multi-tier organizational settings. There have been attempts to build "add-on"solutions to enable fine-grained access control for distributed data analytics platforms. In this paper, first, we show that straightforward enforcement of "add-on"access control is insecure under adversarial code execution. Specifically, we show that an attacker can abuse platform-provided APIs to evade access controls without leaving any traces. Second, we designed a two-layered (i.e., proactive and reactive) defense system to protect against API abuses. On submission of a user code, our proactive security layer statically screens it to find potential attack signatures prior to its execution. The reactive security layer employs code instrumentation-based runtime checks and sandboxed execution to throttle any exploits at runtime. Next, we propose a new fine-grained access control framework with an enhanced policy language that supports map and filter primitives. Finally, we build a system named SecureDL with our new access control framework and defense system on top of Apache Spark, which ensures secure access control policy enforcement under adversaries capable of executing code. To the best of our knowledge, this is the first fine-grained attribute-based access control framework for distributed data analytics platforms that is secure against platform API abuse attacks. Performance evaluation showed that the overhead due to added security is low.
KW - Apache Spark Security
KW - Distributed Systems Security
KW - Fine-grained Access Control
KW - Program Analysis
UR - http://www.scopus.com/inward/record.url?scp=85180153545&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85180153545&partnerID=8YFLogxK
U2 - 10.1145/3627106.3627132
DO - 10.1145/3627106.3627132
M3 - Conference contribution
AN - SCOPUS:85180153545
T3 - ACM International Conference Proceeding Series
SP - 241
EP - 255
BT - Proceedings - 39th Annual Computer Security Applications Conference, ACSAC 2023
PB - Association for Computing Machinery
Y2 - 4 December 2023 through 8 December 2023
ER -