September 8, 2008

1. Bioinformatics:  Prof. Lin http://i.cs.hku.hk/~twlam/

  •  Alignment and assembling
  •  partern matching
  •  mining the sequence
  •  index the DB: BWA, SOAP
  • Applications: Re-sequencing

2. Data stream

  • I .  Small Enough Data Structure
    II. Small Memory( independent from input size)
  •  Sliding window model:
     Focus on the most recent Data(Similar to the web log process in Alipay.)
  • Continuous Monitoring of distributed Data Streams.(Multiple Streams)

3. Online Scheduling Dr. Chan:

  • Charactor: 1. time serials 2. Dynamic Size 3. Online( no priori info)
  • FCFS, SJF, Round Robin 
  • Competitive analysis: Flow(A,I)<=C*Flow(Opt,I) , c names competitive ratio.
  • Best Strategy: Working on the least-time-left job.
  • HKU did a great job on Energy efficiency scheduling
    1 power function( typically f(x)=X^3)
            2 temperature
            3 Sensor network

4. Data mining on uncertain data base. Dr. Ben Kao:http://www.cs.hku.hk/~kao

  •  Decision Tree, Etropy
  •  Curve(divide into several parts), Sampling Tech.

5. Security&Integrity of Data Mining Outsoucing.  Prof. Cheung(Head.) http://www.cs.hku.hk/~dcheung

  • Security: DB-->Encryption-->DB'-->Mining-->Result'-->Decryption-->Result
  • Integrity: Audit environment
    DB+DB'-->Merge-->DB*-->Encryption-->DB*'-->Mining-->Result*'-->Decryption-->Result*-->Audit-->Result.
  • Most important idea is AUDIT! By putting some artificial audit items into the Dataset.
    AFI, AII

6. System research Prof. Wang http://www.cs.hku.hk/~clwang

  • Grid computing
  • PvG
  • MIM 

 

BTW:

Today's GRE AW is quite luck... RP supre hao... 

Tags: ,,.