Abstract:This paper proposes a new way that selects initial cluster center and processes isolated points in the k-means clustering algorithm. And this method improves the deficiency that the k-means algorithm is very sensitive to the initial cluster center and the isolated point text. It applies the improved algorithm in Chinese text clustering. The experimental result indicates the improved algorithm has a higher accuracy compared with the original algorithm, and has a better stability.