1. Bioinformatics: Prof. Lin http://i.cs.hku.hk/~twlam/
- Alignment and assembling
- partern matching
- mining the sequence
- index the DB: BWA, SOAP
- Applications: Re-sequencing
2. Data stream
- I . Small Enough Data Structure
II. Small Memory( independent from input size) - Sliding window model:
Focus on the most recent Data(Similar to the web log process in Alipay.) - Continuous Monitoring of distributed Data Streams.(Multiple Streams)
3. Online Scheduling Dr. Chan:
- Charactor: 1. time serials 2. Dynamic Size 3. Online( no priori info)
- FCFS, SJF, Round Robin
- Competitive analysis: Flow(A,I)<=C*Flow(Opt,I) , c names competitive ratio.
- Best Strategy: Working on the least-time-left job.
- HKU did a great job on Energy efficiency scheduling
1 power function( typically f(x)=X^3)
2 temperature
3 Sensor network
4. Data mining on uncertain data base. Dr. Ben Kao:http://www.cs.hku.hk/~kao
- Decision Tree, Etropy
- Curve(divide into several parts), Sampling Tech.
5. Security&Integrity of Data Mining Outsoucing. Prof. Cheung(Head.) http://www.cs.hku.hk/~dcheung
- Security: DB-->Encryption-->DB'-->Mining-->Result'-->Decryption-->Result
- Integrity: Audit environment
DB+DB'-->Merge-->DB*-->Encryption-->DB*'-->Mining-->Result*'-->Decryption-->Result*-->Audit-->Result. - Most important idea is AUDIT! By putting some artificial audit items into the Dataset.
AFI, AII
6. System research Prof. Wang http://www.cs.hku.hk/~clwang
- Grid computing
- PvG
- MIM
BTW:
Today's GRE AW is quite luck... RP supre hao...
