The LSHK word list was compiled by The Jyutping Work Group. The words in the list come from various sources: Andy Chin's Mid-20th Century Hong Kong Cantonese Corpus, Shin Kataoka and Cream Lee's teaching materials, and the Education Bureau's word list for elementary schools.
Members of the Jyutping Word List Group:
Cheung Kwan Hin (Convenor)
Tang Sze Wing
The LSHK word list contains a generous 15,740 entries that may be searched by various means. I looked up one of my favorite Cantonese expressions, 哎吔, and found four pronunciations, all from the LSHK database: ai1jaa1, ai3jaa4, ai1jaa3, ai2jaa5. In Sheik's CantoDict, 哎吔 has the Jyutping romanization ai1jaa1 and is defined as meaning "Aiya!; a sort of [as in ai1jaa1 lou5dau6 哎吔老豆 ("some kind of daddy")]."
I particularly like the fact that the syllables of the words in the LSHK list are joined together (ai1jaa1) and not divided (ai1 jaa1) or separated by a hyphen (ai1-jaa1), as is the custom on most sites.
As I was preparing this post, I made the serendipitous discovery that the Sheik site has a very valuable parser. You can enter a Chinese text of up to 250 characters into the parser, which "attempts to split it up into component Chinese words. You can then see the meanings of each character and compound word by moving your mouse over the parsed text (we recommend using the Firefox web browser)." Fortunately, I have always and only used Firefox, so the parser worked very well for me.
I put the following two passages, chosen at random from the Mandarin and Cantonese versions of the Wikipedia article on Hong Kong, into the Sheik parser, and I must say that the results were very helpful. Give it a whirl yourself!