Strengthening National Digital Infrastructure Privacy Focused Data Pipelines for Ethical Behavioral Analytics
DOI:
https://doi.org/10.15680/IJCTECE.2023.0604012Keywords:
Behavioural Analytics, Privacy, Pipelines, Digital Infrastructure, EthicsAbstract
In this paper, the researcher will analyze the performance of a privacy committed data pipeline against a standalone centralized one on large data behavioral analytics. The findings indicate that privacy controls introduce small overheads but performance of the system is not lost in a serious manner. The predictive models are also precise, and there is less than a 5 percent loss even in case of the different privacy, synthetic data, or federated analytics. High privacy improvements were realized, such as significant re-identification risk reduction and violation of policy. Telecom and commerce cross-sector testing process assures that the design is generic and is applicable to any dataset. The results indicate that high analytical performance and strong privacy may co-exist.
References
1. Stadler, T., Oprisanu, B., & Troncoso, C. (2020). Synthetic Data -- Anonymisation Groundhog Day. arXiv (CornellUniversity). https://doi.org/10.48550/arxiv.2011.0701 8
2. Ali, M., Naeem, F., Tariq, M., & Kaddoum, G. (2022). Federated Learning for Privacy Preservation in Smart Healthcare Systems: A Comprehensive survey. IEEE Journal of Biomedical and Health Informatics, 27(2), 778–789. https://doi.org/10.1109/jbhi.2022.3181823
3. Choquette-Choo, C. A., Dullerud, N., Dziedzic, A., Zhang, Y., Jha, S., Papernot, N., & Wang, X. (2021). CAPC Learning: Confidential and private collaborative learning. arXiv (Cornell
University). https://doi.org/10.48550/arxiv.2102.0518 8
4. Jayaraman, B., & Evans, D. (2019). Evaluating differentially private machine learning in practice. arXiv (Cornell University), 1895– 1912. https://doi.org/10.48550/arxiv.1902.08874
5. Beese, J., Aier, S., Haki, K., & Winter, R. (2022). The impact of enterprise architecture management on
1090. https://doi.org/10.1080/0960085x.2022.2103045
6. Quach, S., Thaichon, P., Martin, K. D., Weaven, S., & Palmatier, R. W. (2022). Digital technologies: tensions in privacy and data. Journal of the Academy of Marketing Science, 50(6), 1299–1323. https://doi.org/10.1007/s11747-022-00845-y
7. Jones, B., Jones, R., & Luger, E. (2022). AI ‘Everywhere and Nowhere’: Addressing the AI intelligibility problem in public service journalism. Digital Journalism, 10(10), 1731–
1755. https://doi.org/10.1080/21670811.2022.2145328
8. Truong, N., Sun, K., Wang, S., Guitton, F., & Guo, Y. (2020). Privacy Preservation in Federated Learning: An insightful survey from the GDPR Perspective. arXiv (Cornell
University). https://doi.org/10.48550/arxiv.2011.0541 1
9. Milkaite, I., De Wolf, R., Lievens, E., De Leyn, T., & Martens, M. (2021). Children’s reflections on privacy and the protection of their personal data: A child- centric approach to data protection information formats. Children and Youth Services Review, 129, 106170. https://doi.org/10.1016/j.childyouth.2021.106
170
10. Torkzadehmahani, R., Kairouz, P., & Paten, B. (2020). DP-CGAN: Differentially private Synthetic data and label generation. ArXiv (Cornell University). https://doi.org/10.48550/arxiv.2001.0970 0

