대용량 정형 데이터 대상 개인 정보 가명, 익명화를 위한 자동처리 기술
DNA+ ‐ AI 서비스를 위한 데이터 프라이버시 및 네트워크 가상화 기술 연구 사업단
대규모 분산 딥러닝을 위한 인메모리 텐서 데이터베이스
Past Projects
비정형 데이터 비식별 조치 방안
프라이버시 침해 없이 감염병을 예방하는 데이터 기반 시스템
개인정보를 안전하고 편리하게 빅데이터 처리할 수 있는 방법
CCTV 빅데이터 기반 지능형 범죄추적/예방 및 사회안전시스템
빅데이터 환경에서 비식별화 기법을 이용한 개인 정보보호 기술 개발
Tajo: A Distributed Data Warehouse System on Large Cluster
Tajo is a relational and distributed data warehouse system for Hadoop. Tajo is designed for low-latency and scalable ad-hoc queries, online aggregation and ETL on large-data sets by leveraging advanced database techniques. It supports SQL standards. Tajo uses HDFS as a primary storage layer and has its own query engine which allows direct control of distributed execution and data flow. As a result, Tajo has a variety of query evaluation strategies and more optimization opportunities.
Web Log Analyzer
Web Logs Analyzer is a system that user can easily query about real network traffic. It provides various features such as ETL, analytic measure materialization, query registration etc.
Real-time data processing network system
Pub/sub based user preference information system,
Real-time event processing network system,
Scalable system that using cloud computing techniqueGraphMR: A Distributed Graph Match Method using MapReduce
This work is a distributed graph match method on large data sets. This work was submitted to IEEE Transactions on Kowledge and Data Engineering (TKDE).
SPIDER: A System for Scalable, Parallel / Distributed Evaluation of large-scale RDF
This project aims at processing large-scale RDF data. In this project, we developed scale RDF processing method using MapReduce that is a distributed processing framework and storing techniques for large RDF data sets. This project was demonstrated in the 18th ACM Conference on Information and Knowledge Management (CIKM) in November 2009.
R3 - Wireless Broadcast System
This is a system that consist of server and client. The server broadcast news data and the client can connect to the channel and get the data.