| 6-1 | Optimization of Performance in Intersective Photothermoelectrical Converters of                         | 396 |
|-----|---------------------------------------------------------------------------------------------------------|-----|
|     | Solar Energy until 500k Temperatures                                                                    |     |
| 6-2 | Ground state energy of wannier exciton in spherical quantum dots                                        | 400 |
| 6-3 | Ion implantation of Rare-Earth elements in a structure of optical materials                             | 403 |
| 6-4 | The photoreceiver with different spectral sensitivity nanocrystall structure in a basic<br>CdTe-SiO2-Si | 406 |
| 6-5 | Development of the stand for researches spectral characteristics of the optical fibre                   | 410 |
| 6-6 | Dependency of Program Efficiency on Implementation Conditions for NOR Type                              | 412 |
|     | Silicon-on-Insulator (SOI) Flash Memory                                                                 |     |

# TH7: Commnication Network II

| 7-1  | Precise Network Synchronization Technique Using Phase Adjustment and External  | 416     |
|------|--------------------------------------------------------------------------------|---------|
|      | Filtering                                                                      | 100.000 |
| 7-2  | Mutual Authentication Scheme between Mobile Routers using Authentication Token | 420     |
|      | in MANEMO                                                                      |         |
| 7-3  | Place Multi-Level Near-Secure Communication Mechanism for Mobile Devices       | 424     |
| 7-4  | Performance Analysis of Ant-based Routing Algorithms for MANETs                | 428     |
| 7-5  | Analysis and estimation of QoS providing methods for applying in NGN           | 432     |
| 7-6  | QoS providing architecture model                                               | 436     |
| 7-7  | Timing Estimation Method for Parameterized Macro Model Considering Pin-to-Pin  | 439     |
|      | Delay in High-Level Synthesis                                                  |         |
| TH8: | VLSI Design and Applications                                                   |         |
| 8-1  | New Motion Vector Prediction for Pipelined Motion Estimation in H.264/AVC      | 443     |

| 8-2 | Design of Low Complexity LDPC Encoder for IEEE 802.11n WLAN Systems            | 447 |
|-----|--------------------------------------------------------------------------------|-----|
| 8-3 | VLSI Implementation of Counter Mode AES for G-PON                              | 451 |
| 8-4 | The Algorithm and Hardware Implementation of Average Luminance Computation     | 456 |
|     | Method of Image to Improve Auto Exposure in Mobile Camera                      |     |
| 8-5 | A Hard Decision LDPC Decoder Implementation for Error Correction of NAND       | 460 |
|     | Flash Memory                                                                   |     |
| 8-6 | A highly linear LNA employing transconductance non-linearity cancellation with | 464 |
|     | the desensitization technique of harmonic feedback effect                      |     |

# TH9: Modeling & Simulations

| 9-1 | Modeling TUIT's Secondary Corporate Network | 468 |
|-----|---------------------------------------------|-----|
| 9-2 | Simulation of ATM networks using OPNET      | 471 |

| 9-3  | About one clustering algorithm in intellectual data analysis                            | 476               |
|------|-----------------------------------------------------------------------------------------|-------------------|
| 9-4  | The Study of Distribution of Ions Channeling in Single Crystals by Computer             | 479               |
|      | Modeling                                                                                |                   |
| 9-5  | Methods of algorithmization in intellectual systems                                     | 483               |
| 9-6  | Using Technology with Multiple Models an Differentiation                                | 485               |
| 9-7  | About one way of increase of the system effectiveness of detection of intrusions        | 487               |
| TH10 | : IT-business & Computer Systems                                                        |                   |
| 10-1 | Interactive methods of teaching a foreign language                                      | 491               |
| 10-2 | Formation of risk-management in sphere of information and communication                 | 493               |
|      | technologies of the Republic of Uzbekistan                                              |                   |
| 10-3 | Online Payment Service System for Mobile Companies in Uzbekistan                        | 495               |
| 10-4 | Private Box Algorithm in vital information exchange systems                             | <mark>49</mark> 7 |
| 10-5 | Performance Evaluation of GCC 4.x on EISC Architecture                                  | 499               |
| 10-6 | Recoplay: An Effective Record-Replay Approach to Debugging Complex<br>Embedded Software | 503               |
| 10-7 | New information technology and social institutions                                      | 507               |
| THII | : Bio Technologies                                                                      |                   |
| 11-1 | Equal Error Rate Minimization for Biometrics Fusion                                     | 513               |
| 11-2 | Forecasting of developing processes on the basis of gravitational forces                | 517               |
| 11-3 | New Injection-Voltaic Effect Elementary Basis                                           | 520               |
| 11-4 | Development of high-speed multi-valued elements and structures of modelling neurons     | 524               |
| 11-5 | The non functional equations buildings methodic                                         | 526               |
| 11-6 | High-operated multi-valued memorizing device                                            | 529               |
| 11-7 | Improvement of the diagnostics and treatments neurological sick on base                 | 531               |
|      | information technology                                                                  |                   |
| TH12 | : Video/Audio/Speech Processing                                                         |                   |
| 12-1 | Rate Control Initialization Scheme for H.264/AVC                                        | 534               |
| 12-2 | Real-time Global Stereo Matching Implementation On FPGA                                 | 538               |
| 12-3 | Fast Multiple Reference Frame Motion Estiamtion Method for H.264/AVC                    | 541               |

- 12-4 Objectively Quantified Consonance of Complex Sounds
  12-5 A Bit Reduction Algorithm for Spectral Band Replication Using Threshold in Quiet
- 12-6 Adaptive Length Block Truncation Coding for LCD Overdrive

545

549

554

# PS1: Poster Session I

| 1-1  | Implementation of Ternary Adder and Multiplier Using Current-Mode CMOS              | 559 |
|------|-------------------------------------------------------------------------------------|-----|
| 1-2  | Relational Analysis between EEG and Respiration                                     | 564 |
| 1-3  | A New Comparator Offset Cancellation Method                                         | 567 |
| 1-4  | A Study on the Design of the On-Chip 2.5V-to-1.0V VDC for Semiconductor             | 571 |
|      | Devices                                                                             |     |
| 1-5  | A 6-bit 1.25-GS/s 1.94pJ/step Flash ADC in 0.13-µm CMOS                             | 575 |
| 1-6  | The Design of DC-DC Converter with DTMOS Switch                                     | 579 |
| 1-7  | A Low-Voltage-Triggering-Dual-directional SCR Device for ESD Protection             | 583 |
| 1-8  | 8.3~10.6GHz-Band New LC Oscillator with Low Phase-Noise and Wide Tuning             | 588 |
|      | Range for SONET Communication Applications                                          |     |
| 1-9  | Design of A Microstrip Linear Tapered Slot Antenna                                  | 592 |
| 1-10 | Stable emitter followers based on injection-voltaic transistors for power amplifier | 597 |
| 1-11 | Analysis of Epileptic Seizures in EEG using Wavelet Transforms                      | 600 |
| 1-12 | Performance improvement of LCMV beamformer using matrix property mapping            | 604 |
| 1-13 | EMG signal compression with an ACELP coder using trellis-coded quantization of      | 608 |
|      | LSF parameters                                                                      |     |
| 1-14 | Detection of Underwater Transient Signals Using EVRC Noise Suppression Module       | 612 |
| 1-15 | Performance Analysis of Adjacent Channel Interference based on Spectrum Masks       | 617 |
|      | in OFDM Wireless Communications                                                     |     |
| 1-16 | Search and Selection for a New Spreading Code set and its Application to IR-UWB     | 621 |
|      | System                                                                              |     |
| 1-17 | Inter-cell Cyclic Delay and Transmit Diversity for MIMO-OFDM Broadcasting in        | 625 |
|      | Cellular Environments                                                               |     |
| 1-18 | Redesign of Probability Distribution to Select Contention Slot in MAC protocol for  | 629 |
|      | Real-Time Wireless Sensor Network                                                   |     |
| 1-19 | Performance of Noise-Predictive Turbo Equalization for PMR Channel                  | 633 |
| 1-20 | Hybrid A Priori Error and A Posteriori Error-based Gradient Adaptive Step-size      | 637 |
|      | LMS Algorithms                                                                      |     |
| 1-21 | Mobile Book Search System Using WiBro and Voice Information Technology              | 641 |
| 1-22 | PAPR Reduction of an OFDM Signal Using Selected Mapping Based on Erasure            | 645 |
|      | Decoding                                                                            |     |
| 1-23 | Design of a Variable Reed-Solomon Decoder Based on Shortening and Puncturing        | 649 |
| 1-24 | Real-time Response of Optically-Controlled Microwave Pulses through Open-           | 653 |
|      | Ended Microstrip lines by the Way of Differential Analysis                          |     |

# Precise Network Synchronization Technique Using Phase Adjustment and External Filtering

\*Chang-Kyung Seong, \*\*Seung-Woo LEE, and \*Woo-Young Choi

\*Department of Electrical and Electronic Engineering, Yonsei University, Seoul, Korea \*\*NIP Technology team, Electronic Telecommunication Research Institute, Daejoen, Korea wchoi@yonsei.ac.kr

Abstract – Network synchronization is extremely important for many applications. In order to improve the performance of Pulse Addition and Swallowing (PAS) method, which considers only TOD (Time-Of-Day) adjustment, we investigate clock rate synchronization. To achieve the extension of frequency resolution and deterministic control gain in clock rate control, the clock phase is directly adjusted. Also, digital LPF is used to mitigate fluctuation of clock rates due to granularity of TOD. The proposed structure is modeled and simulated with MATLAB. Simulation results for cascaded ten nodes show that the peak-to-peak time offset of 3UI is reduced to about 0.21UI by using LPF.

#### 1. Introduction

Time and frequency synchronization between distributed network nodes has many important benefits by making network elements share the same time information and clock frequency. Applications such as real-time AV streaming, automatic control or manufacturing, wireless communication, and positioning systems especially require precise synchronization due to hard timing constraints.

Several approaches have been considered for network synchronization. Most of them are based on two-way message exchange using the time stamp specified in IEEE 1588 [1]. In order to adjust time rate, PAS method has been most widely used in synchronization systems [2]. In addition, DCO (Digitally-Controlled Oscillators) or DCXO (Digitally-Controlled Crystal Oscillators) have been used to generate variable clock frequency in ADPLL (All-Digital Phase Locked Loops) applications [3][4]. The PAS method, however, does not adjust the frequency of clock signal but only the increment rate of TOD. Although use of DCO or DCXO allows adjustment of clock frequency, the resulting frequency resolution is not sufficient and complex hardware is required to overcome this. It is also a problem that their characteristics are not deterministic due to PVT variations.

Granularity of TOD also degrades the performance of synchronization system. With inherently quantized TOD, the time offset between two nodes fluctuates even in synchronization due to the uncertainty of time information.

We propose a new structure to reduce the fluctuation of time offset caused by finite frequency resolution and granularity of TOD. Section II and III describe time synchronization model and problems causing peak-to-peak fluctuation of time offset, respectively. Section IV proposes two ideas to overcome these problems and section V wraps up the paper with conclusion.



Figure 1: One set of message exchange between master and slave nodes to measure time offset and delay

#### 2. Time Synchronization Model

Figure 1 shows the two-way message exchange procedure between master and slave nodes described in IEEE 1588 PTP (Precision Time Protocol) [5]. This diagram has two independent time axes, master and slave nodes. Each axis corresponds to TOD value based on its own free-running oscillator. In order to synchronize slave's own clock frequency and TOD value to those of master node, slave node measures time offset from master and network delay.

One set of procedure consists of four messages. At the start of a set, master node sends *Synch Message* to slave node. When the message arrives at slave after propagation and delay (D), the slave node marks and stores the arrival time  $T_2$  referring to slave's TOD value. After a while, *Follow-up Message* containing the departure time of the *Synch Message* ( $T_1$ ) is sent to slave node. In order to get delay time (D) and separate it from time offset (O), slave node sends *Delay Request Message* to master node at  $T_3$ , again. When master node detects this message, master node stores the arrival time  $T_4$  referring to master's TOD value. By returning  $T_4$  back with *Delay Response Message* from master node, slave node is able to calculate O and D as shown in equation (1) and (2), separately. Note that the equations, however, are held for symmetric delays in both directions.

$$O = \frac{(T_2 - T_1) - (T_4 - T_3)}{2} - (1)$$
$$D = \frac{(T_2 - T_1) + (T_4 - T_3)}{2} - (2)$$



**Figure 2:** Simplified synchronization model with clock rate adjustment: (a) times of each node vs. time of master node (b) time offset between slave and master nodes vs. time of master node.

Figure 2 (a) and (b) illustrate simplified synchronization models. Here, only two nodes are supposed for simplicity. In figure 2 (a), horizontal and vertical axes represent the time of master node and the time of each node, respectively. Note that the slope means the normalized clock rate of each node. Dashed line is the time of master node with unity slope. On the other hand, segmented solid lines are the time of slave node. Slave node adjusts its own clock rate after every message exchanges. Without perfect synchronization, the n-th segment of solid line has slightly different slope from that of master node due to frequency offset,  $\Delta_n$ . Figure 2 (b) is a modified version of figure 2 (a). Now vertical axis represents time offset between slave and master node. By denoting the time offset at the n-th message exchange as  $O_n$ , simple equation is derived as follows

$$\frac{O_{n+1}-O_n}{T_n} = \Delta_n \cdot - (3)$$

The frequency offset,  $\Delta_n$ , that the synchronizer should have after the n-th message exchange is derived from equation (3) and its recursive version as follows

$$\Delta_n = \Delta_{n-1} + \frac{O_{n+1} - O_n}{T_n} - \frac{O_n - O_{n-1}}{T_{n-1}} \quad . \quad (4)$$

Finally, an equation (5) is derived by forcing the next time offset,  $O_{n+1}$ , to be zero,

$$\Delta_n = \Delta_{n-1} - \frac{O_n}{T_n} - \frac{O_n - O_{n-1}}{T_{n-1}} \cdot - (5)$$

## **3.** Problems in Synchronization Systems

#### A. Finite frequency resolution

In digitally-controlled clock synchronization systems, the amount of controllable frequency step is limited. Quantization errors due to the finite frequency step continuously accumulate time errors during a message exchange interval. The amount of error accumulation is proportional to the message exchange interval. As network systems demand longer message exchange intervals, more accurate and precise frequency control is necessary.

The minimum frequency adjustment step should be able to allow fine tuning, so that the time error accumulation during an exchange interval is under 0.5 UI [2], or

$$T \cdot \Delta f < 0.5 - (6),$$

where T is the normalized message exchange interval and  $\Delta f$  is the frequency resolution. In IEEE 1588, nominal values of clock rate and message exchange interval are 25MHz and 2 seconds, respectively. As a result, T is  $5 \times 10^7$  UI and  $\Delta f$  must be less than 0.01ppm. IEEE 1588 allows the free-running frequency offset up to  $\pm 100$ ppm. Therefore, all nodes should be able to change their frequency rate up to  $\pm 200$ ppm in the worst case. In order to adjust frequency from 0.01ppm to 200ppm, slave nodes must the control frequency rate with at least 15-bit digital code [2].

Several approaches have been used for rate adjustment. Among these, the PAS method is the most straightforward, and has been used in most IEEE 1588 implementations [2]. This method adjusts the rate of TOD by increasing or decreasing the increment of TOD, depending on whether the clock rate should get faster or slower. If faster clock rate is needed, larger increment of TOD is occasionally selected, and vice versa. Despite the ease of implementation, the accuracy of synchronization is limited by the resolution of increment TOD. And PAS does not provide the adjustment of the clock rate.

As tools to digitally control the oscillation frequency, DCOs and DCXOs have been widely used in ADPLLs. Since it is hard to make the resolution of DCO or DCXO more than 15-bit,  $\Delta\Sigma$  modulators are used to increase effective frequency resolution [3][4]. However, DCOs integrated in a silicon die suffer from fluctuation of free-running frequency, poor phase noise, and PVT variation, yielding non-deterministic DCO gain and oscillation range. Although DCXOs have more stable oscillation frequency and better phase noises compared to DCO, they are expensive to implement and also have non-deterministic gain.

#### **B.** Granularity of TOD

Time information, or TOD, is generated by a digital counter triggered by its own system clock signal. Without using PAS, the minimum step of TOD value is fixed to the amount of 1 UI of system clock signal. Since real (or analog) departure and arrival times (T1, T2, T3 and T4) are quantized to integer multiple of UI, they can be decomposed into two parts: the TOD  $[T_k]$  and quantization error  $e_k$ ,

$$[T_k] = T_k + e_k$$
,  $(k = 1, 2, 3, 4) - (7)$ 

where [X] rounds X to the nearest integer toward  $+\infty$ , and  $0 \le e_k < 1$  in UI. Calculated time offset by message exchange (O') is also quantized from real time offset (O) as shown in equation (8);

$$O = \frac{([T_2] - [T_1]) - ([T_4] - [T_3])}{2} - \frac{(e_2 - e_1) - (e_4 - e_3)}{2} - (8)$$
  
=  $O' - E$   
where  $O' = \frac{([T_2] - [T_1]) - ([T_4] - [T_3])}{2}$ , and  $E = \frac{(e_2 - e_1) - (e_4 - e_3)}{2}$ .



Figure 3: Effect of finite time resolution



**Figure 4:** Behavioral simulation of time synchronization with TOD

Figure 3 plots the effect of equation 8. It is observed that calculated time offset (O') will have one among three or four values even for one real time offset value (O), depending on the condition of four variables,  $e_1$ ,  $e_2$ ,  $e_3$  and  $e_4$ . This uncertainty due to granularity of TOD degrades synchronization performance. Since the calculated time offset (O') under  $\pm 0.5$ UI is not detectable, this range can be considered as dead-zone for detection. Therefore, resulted real time offset (O) can fluctuate up to  $\pm 1.5$ UI.

In order to show the effect, a behavioral model of synchronization system was realized using MATLAB and simulations were performed with granulated TOD instead of real analog time. Following assumptions are made ain simulations:

 $\cdot$  Slave node has different initial TOD value and free-running frequency with master node;

· Message exchanges are executed every  $2 \times 10^8$  clock cycles. (2 sec with 100-MHz clock);

· Up- and down-link Network delays are symmetric;

• The granularity of TOD is 1 UI.

As expected, figure 4 shows that time offset fluctuates around zero with peak-to-peak variation of  $\pm 1.5$ UI.

## 4. A New Structure

#### A. Phase adjustment

The mechanism of a new approach with direct control of phase is depicted in figure 5. Straight solid line represents slave's time offset from master with free-running frequency offset. Segmented solid lines are the result of phase shifts to adjust the frequency of slave clock. Suppose that clock signal can be shifted by 1/N UI in phase, or minimum phase shift



**Figure 5:** Generation of effective frequency offset using phase adjustment



Figure 6: Block diagram of clock synchronization system using phase adjustment

step is 1/N UI. In the figure, there are total M-times shifts of unit phase step at every  $K_n$ -th clocks to generate frequency deviation as much as target frequency offset  $\Delta_n$  from the freerunning line. The accumulated time offset during T is the same as the amount of total phase shift as shown in equation 9,

$$\left|\Delta_n\right|T = \frac{M_n}{N} \cdot - (9)$$

And, the period of phase shift, K<sub>n</sub>, is derived as follows.

$$K_n = \frac{T}{M_n} = \frac{1}{|\Delta_n| \cdot N} - (10)$$

By handling the phase of clock signal instead of frequency, effective frequency resolution becomes almost infinite. The minimum effective frequency offset is generated by only one step of phase shift during one message exchange interval. Moreover, control gain is deterministic and immune to PVT variation unlike oscillators mentioned above. Although non-uniform phase difference of N clocks can cause unwanted phase error, the total sum of phase error is zero. The drawback of this approach is that the phase of slave clock periodically jumps as much as 1/N UI and the peak-to-peak jitter of clock is 1/N UI. However, this effect becomes negligible by choosing large N.

Figure 6 shows a part of block diagram of proposed clock synchronization system. External Crystal Oscillator (XO) provides a reference clock with free-running frequency offset. N-phase clocks having the same frequency with a reference clock are generated by the multi-phase clock generator. One of them is dynamically selected as a slave clock by N:1 MUX according to phase control code from phase controller (PC).



**Figure 7**: Complete block diagram of proposed clock synchronization system

TOD value is counted up by TOD generator triggered by the slave clock. A block named slave message exchanger makes handshaking with master node and delivers results such as  $T_1$ ,  $T_2$ ,  $T_3$  and  $T_4$  to PC. Finally, the PC calculates  $\Delta_n$  and  $K_n$  and counts up or down the phase control code at every  $K_n$  clock cycles.

#### **B.** External filtering

Intuitively, low-pass filtering can be considered to reduce dithering. By inserting an LPF into time estimator loop, high-frequency dithering is obviously reduced. However, LPF does not change detecting resolution of TOD-based message exchange, maintaining the total peak-to-peak value of dithering as  $\pm 1$ UI. We used an external LPF instead of internal one.

A complete block diagram of proposed clock synchronization system is given in figure 7. The system consists of XO, multi-phase clock generator, and two parts with dashed and dotted boxes, respectively. The part within dashed box aforementioned in figure 4 is named as a time estimator (TE) because its TOD value is not absolutely the same with master's one but dithers around it. With this reason, the word 'unfiltered' is attached in front of all names of nodes in the TE block.

On the other hand, the part with dotted box is newly added and named as time filter (TF). This part has a similar structure to the TE except for addition of digital LPF. Digital LPF in TF block receives  $\Delta_n$  from PC in TE and filters high frequency component out to mitigate fluctuation of time offset. Filtered  $\Delta_n$  in turn is fed into a PC in the TF block and used for calculation of phase shift period (K<sub>n</sub>). Finally, filtered slave clock with filtered frequency offset is generated and results in a new TOD value. This TOD value is used for another message exchange with the next slave node.

It is remarkable that deterministic characteristics of the phase control method make external filtering possible. With mismatch between two devices that generates frequency offset within TE and TF respectively, control signal make different effects for each one so that TF generates clock signal with different frequency.

In another MATLAB simulation, the function of TF with FIR filter is newly modeled in network nodes. It is assumed that ten network nodes are serially chained with master-slave relationship. (The maximum number of nodes is specified as ten in IEEE 1588 standard [5].) Each node (except for



**Figure 8**: Time offsets of serially chained nine nodes from the time of grandmaster. (Filtered TOD value)

grandmaster) tries to synchronize its time and frequency to its prior node. Other conditions of the previous simulation were maintained. Figure 8 shows the time offsets of each node from the time of grandmaster with FIR LPF having cut-off frequency of 0.028 Hz with 32 taps and 16-bit resolution. The simulation shows that the final node has the worst case peakto-peak time offset of 0.21 UI.

#### 5. Conclusions

We propose a new structure of time synchronizer for network synchronization. Finite frequency resolution and granularity of TOD are pointed out as the cause of time offset fluctuation. By directly controlling the clock phase, effective frequency resolution can be maximized and control gain becomes deterministic. With fixed control gain, a new approach with an external filter block is proposed and verified using MATLAB simulation. The simulation with serially chained ten nodes shows that the fluctuation of time offset is reduced from 3UI to 0.21UI by external filtering.

### Acknowledgments

This work was supported by the IT R&D program of MIC/IITA. [2007-S012-02, Multimedia Convergence Network on Chip Technology Development]

### References

- 1. Geoffrey M. Gardner and Kees den Hollander, "Analysis of Clock Synchronization Approaches for Residential Ethernet," *Proceedings of the 2005 Conference on IEEE 1588*, Oct. 10-12, Zurich.
- John C. Eidson, Measurement, Control, and Communication Using IEEE 1588: Chapter 5 Practical Issues in Implementing IEEE 1588, Springer, 2006.
- Do-Hwan Oh, Deok-Soo Kim, Suhwan Kim, Deog-Kyoon Jeong, and Wonchan Kim, "A 2.8-Gb/s All-Digital CDR with a 10-bit Monotonic DCO", *ISSCC Dig. Tech. Papers*, pp.222-223, Feb., 2007.
- 4. Jerry Lin, "A Low-Phase-Noise 0.004-ppm/Step DCXO With Guaranteed Monotonicity in the 90-nm CMOS Process," *IEEE J. Solid-State Circuits*, pp. 2726-2734, Dec., 2005.
- IEEE Std<sup>™</sup> 1588-2002, *IEEE Standard for a Precision Clock Synchronization Protocol for Networked Measurement and Control System*, IEEE, Nov. 8, 2002.