Skip to main content

History of Astronomy: Part 1

Astronomy and astrophysics have a rich history that spans thousands of years. In particular, astronomy has been practiced since ancient times, so ancient that it could have been the first scientific discipline. Ancient civilizations were keen observers of the night sky, and they developed various methods to track celestial objects. Geological evidence suggests that the first astronomical observations were made by prehistoric humans, who used simple tools to observe the stars and planets. Such evidence includes cave paintings and carvings that depict celestial objects, such as the Moon and stars. The ancient Egyptians, for example, built the pyramids in alignment with the stars, and they used the heliacal rising of Sirius to mark the start of their new year.

The motion of celestial objects was observed and recorded by ancient civilizations, leading to the development of calendars and timekeeping systems. Clocks leveraged the regular motion of celestial objects, such as the Sun and Moon, to measure time. Sundials, for example, used the position of the Sun in the sky to indicate the time of day.

There are many reasons to study the history of astronomy. The history of astronomy is intertwined with the history of physics as a whole, as the study of celestial objects has often driven advancements in our understanding of the physical world. Advances in theoretical physics, such as Kepler's laws, Newton's law of gravitation, quantum mechanics, and general relativity, have all been fully or partially motivated by the need to explain astronomical observations. As such, when we learn about the history of astronomy, we are concurrently learning about the development of Newtonian mechanics and electromagnetism; the different models of atoms; the development of quantum mechanics; and the theory of general relativity.

In this section, we will explore the history of astronomy and astrophysics, from its ancient origins to modern times. This first part will focus on the ancient and classical periods prior to Newtonian mechanics.

Table of Contents

Greek Astronomy

The modern, scientific study of astronomy began with the ancient Greeks who treated astronomy as a branch of mathematics. Due to the limited technology available at the time, the Greeks lacked sophisticated telescopes and other instruments. However, they made significant contributions to the field of astronomy through careful observations and logical reasoning.

The term planet comes from the Greek word πλανήτης (planētēs), which means "wanderer." The initial model was one in which the Earth was at the center, and all celestial objects moved in perfect circles around it. This model, known as a geocentric (geo meaning "Earth") model, was proposed by philosophers such as Eudoxus of Cnidus (c. 408-355 BCE) and later refined by Plato (c. 428-348 BCE) and Aristotle (384-322 BCE). It was the dominant model because it seemed obviously true by observation: the Sun, Moon, and stars all appeared to move in circular paths around the Earth.

As the stars moved in a circular path, their motion could be easily observed and predicted. Each star had a position that obeyed a simple equation of motion. Defining the origin of the coordinate system at the center of the Earth, the position vector evolves over time according to

where is a rotation matrix that rotates the position vector by an angle proportional to the time interval . (Obviously, this mathematical notation is anachronistic — not from the time of Plato — but it is a modern representation of the idea.)

With this model, we are able to easily predict not only the positions, but also the velocities, accelerations, energies, and other physical quantities that describe the motion of celestial objects. This allows for a complete description of the motion of celestial objects in the sky. However, the geocentric model was not without its problems.

Retrograde Motion

One of the major problems with the geocentric model was the phenomenon of retrograde motion. As a planet moves in its orbit, it appears to move in the opposite direction for a period of time. This is known as apparent retrograde motion. The most famous example of retrograde motion is that of Mars, which appears to move in the opposite direction for a few months every two years, as shown in the image below.

Retrograde motion of Mars

Apparent retrograde motion of Mars in 2003 as seen from Earth. By Eugene Alvin Villar (seav) - Own work, CC BY-SA 4.0, Link

Hipparchus (c. 190-120 BCE) proposed a solution where the planets moved in a circular path (an epicycle) around a point that itself moved in a circular path around the Earth (the deferent). In other words, the planets moved in a circular path around a point that itself moved in a circular path around the Earth. Mathematically, this looks like adding another circular motion to the original circular motion. If the original circular motion is given by , then the position vector of the planet is given by

where is a rotation matrix that rotates the position vector by an angle proportional to the time interval . These two circular motions together create a more complex motion that can explain the retrograde motion of planets.

Remark

Technically, adding on more circles allows for any motion to be described (Fourier series), but not only was this not known at the time, it also made the model more complicated than it needed to be.

Claudius Ptolemy (c. 100-170 CE) later refined this model by introducing the concept of equants (or punctum aequans). An equant is a point that is not the center of the deferent, but is used to define the motion of the planet. Specifically, the center of the epicycle (or the eccentric) moves with a constant angular velocity around the equant, instead of the center of the deferent.

This model, known as the Ptolemaic system, was able to make better predictions of the positions of planets, but it was still not perfect. For one, the model was needlessly complicated; it forsook the simplicity of uniform, circular motion for a more complex model that was not as elegant. Furthermore, in order to make the model align with new observations, new epicycles and equants had to be added, which made the model even more complicated. This led to the model becoming increasingly complex and unwieldy, with many epicycles and equants being added to explain the motion of planets.

Copernicus and the Heliocentric Model

Nicolaus Copernicus (1473-1543), a Polish mathematician and astronomer, proposed a new model of the solar system in which the Sun was at the center, and the planets moved in circular orbits around it. This model, known as the heliocentric (helio meaning "Sun") model, was a radical departure from the geocentric model. Immediately, it gave a much simpler description of the motion of planets, as it eliminated the need for epicycles and equants. This was the first heliocentric model that was supported by evidence, and it was a major breakthrough in the history of astronomy.

The problem, however, was that during his time, the Catholic Church held the geocentric model as a theological doctrine. The idea of the Earth moving around the Sun was seen as heretical, and Copernicus feared persecution for his ideas. Therefore, he published his work, "De revolutionibus orbium coelestium" ("On the Revolutions of the Celestial Spheres"), only shortly before his death in 1543. One approach at the time was to claim that the heliocentric model was just a mathematical tool, but does not reflect reality.

Mathematically, we can formulate the heliocentric model as follows. Let the origin of the coordinate system be at the center of the Sun. Then, the position vector of a planet is given by

where is a rotation matrix that rotates the position vector by an angle proportional to the time interval . The rotation of each planet can be characterized by parameters such as the orbital radius and the angular velocity.

Notably, consider that there exist planets that are closer to the Sun than the Earth, such as Venus and Mercury. Likewise, there exist planets that are farther from the Sun than the Earth, such as Mars, Jupiter, and Saturn. These are known as inferior and superior planets, respectively. This distinction leads to different observational phenomena, including the apparent retrograde motion as previously discussed.

Perhaps an animation will help illustrate how apparent retrograde motion arises naturally in the heliocentric model.

Sun reference frame

Earth reference frame

See how the line sometimes backtracks and moves backwards, indicating apparent retrograde motion. This is because as the Earth moves faster in its orbit, it overtakes Mars, causing Mars to appear to move backwards against the backdrop of distant stars. To quantify this phenomenon, we need to introduce some new definitions.

A conjunction occurs when two celestial objects have the same right ascension or ecliptic longitude, as observed from Earth. An opposition occurs when two celestial objects are on opposite sides of the sky, as observed from Earth. In other words, Mars and Earth are in conjunction when they are aligned with the Sun, and they are in opposition when they are on opposite sides of the sky. When Mars is in opposition, it is closest to Earth, and it appears brightest in the sky.

The time interval between two successive oppositions of Mars is known as the synodic period , experimentally measured to be approximately 780 days. The time interval between two successive conjunctions of Mars is known as the sidereal period . The sidereal period of Earth, , is known to be 365.256 days. Another way to interpret this is that the synodic period is the period of a planet as observed from Earth, while the sidereal period is the period of a planet as observed from the Sun.

We can relate these periods to their respective angular velocities. The angular velocity of Earth is given by , while the angular velocity of Mars is given by . The angular velocity of Mars as observed from Earth is . And since this angular velocity corresponds to the synodic period, we have .

Thus, we have

This equation allows us to calculate the sidereal period of Mars, given the synodic period and the sidereal period of Earth. Plugging in the values, we have

which gives us days.

Although the heliocentric model was a major breakthrough in the history of astronomy, it was not without its problems, and did not introduce any significant new predictive power. This is because Copernicus was still unable to forego the idea of uniform, circular motion. As such, he still had to introduce epicycles and equants to explain the motion of planets. This led to the model being only slightly simpler than the Ptolemaic system, and it was not able to make significantly better predictions of the positions of planets.

Towards Newtonian Astronomy

In order to improve the models of planetary motion, we need to abandon the idea of uniform, circular motion. One important step is to introduce a coordinate system that is able to describe the motion of planets in a reliable and consistent manner.

Altitude-Azimuth and Equatorial Coordinate Systems

The altitude-azimuth coordinate system is a spherical coordinate system that is used to describe the position of celestial objects in the sky. It is defined by two angles and centered on the observer's location on Earth. The horizon (the plane tangent to the Earth's surface at the observer's location) is the reference plane for this coordinate system. The zenith is the point directly above the observer, and the nadir is the point directly below the observer. The meridian is the great circle that passes through the zenith and the north and south points on the horizon.

The altitude angle (or elevation angle), , is the angle between the horizon and the line of sight to the celestial object. It is measured from 0° at the horizon to 90° at the zenith. Equivalently, the zenith angle, , is the angle between the zenith and the line of sight to the celestial object, measured from 0° at the zenith to 90° at the horizon. These are related by .

The azimuth angle, , is the angle between the north direction and the projection of the line of sight to the celestial object onto the horizon plane. It is measured clockwise from 0° at the north to 360°.

This system is useful for locating celestial objects in the sky, as it allows us to specify their position in terms of angles that are easy to measure. There are limitations to this system; for example, the coordinates of a celestial object will change over time as the Earth rotates and orbits the Sun. Moreover, the coordinates depend on the observer's location on Earth, and are difficult to use for comparing observations from different locations.

The Earth rotates around its axis once every day, and orbits the Sun once every year (the sidereal period). Note that the sidereal period is 365.26 days, which means that in 24 hours, the Earth orbits the Sun by about 1 degree. As such, to make the Sun cross the same meridian every day, Earth must spin about 361 degrees in 24 hours.

On the other hand, for stars further away, the effect of Earth's orbit is negligible, and they appear to cross the same meridian every 23 hours and 56 minutes (the sidereal day). This means that the stars appear to move about 1 degree every 4 minutes, or about 15 degrees every hour. This is why the stars appear to move in the sky over the course of a night.

To address the limitations of the altitude-azimuth coordinate system, astronomers developed the equatorial coordinate system. If we trace the path of the Sun in the sky (the ecliptic), we find that the declination of the Sun changes over the course of a year. (See figures 1.11 and 1.12 in Caroll and Ostlie.) The declination, , is the angle between the celestial equator and the line of sight to the celestial object. The right ascension, , is the angle between the vernal equinox and the projection of the line of sight to the celestial object onto the celestial equator. Essentially, the equatorial coordinate system is a celestial version of the latitude-longitude system used on Earth. No longer is the coordinate system centered on the observer, but rather on the celestial sphere.

This is, of course, still not perfect. As the Earth is not a perfect sphere, and it interacts gravitationally with the Moon and the Sun, the Earth's axis of rotation wobbles, or precesses, over time. This causes the coordinates of celestial objects to change over time, and the equatorial coordinate system must be updated periodically to account for this. However, this is a much smaller effect than the rotation and orbit of the Earth, and it is much easier to account for.

When we specify the coordinates of a celestial object, we must also specify the epoch, or the date at which the coordinates are valid. Today, we use the epoch J2000.0, which corresponds to January 1, 2000, at 12:00 TT (Terrestrial Time). The J stands for Julian, for the calendar introduced by Julius Caesar in 46 BCE. The coordinates of celestial objects are given in terms of right ascension and declination:

where and are constants that depend on the epoch, and and are the changes in right ascension and declination, respectively. In J2000.0, we have

where is the time in Julian centuries since J2000.0.

Another effect that must be accounted for originates from the velocities of celestial objects. We can decompose an object's velocity into two components: the radial velocity (the component along the line of sight) and the tangential velocity (the component perpendicular to the line of sight). The radial velocity can be measured using the Doppler effect, as we have previously discussed in both nonrelativistic and relativistic contexts. The tangential velocity manifests itself as a change in the equatorial coordinates of the object over time, known as proper motion. By definition,

If the radius to the object is known, then we can relate the change in distance to the change in angle. Therefore,

Finally, the proper motion is the rate of change of with respect to time:

It can be shown (see Caroll and Ostlie, section 1.3) that the proper motion leads to changes in the equatorial coordinates given by

Elliptical Orbits and Kepler's Laws

The next step in improving our models of planetary motion is to abandon the idea of circular orbits. Following the death of Copernicus (1543), Danish astronomer Tycho Brahe (1546-1601) made the most accurate observations of planetary positions up to that time. He used large, precise instruments to measure the positions of planets and stars, and he recorded his observations over a period of 20 years.

This is a tangent, but Brahe had a very interesting life. He had a pet elk, which he famously brought to social gatherings and even allowed to roam freely in his home. The elk reportedly died after drinking too much beer at a party and falling down a flight of stairs. Brahe also lost part of his nose in a duel, and he wore a prosthetic nose made of brass. He was known for his flamboyant personality and his eccentric behavior, which made him a well-known figure in his time.

Johannes Kepler, a former assistant of Brahe, later formulated the laws that describe the motion of planets based on those observations. Using Mars as a test case, Kepler found that the orbit of Mars could not be described by a circle, but rather by an ellipse. In 1609, Kepler published his first two laws of planetary motion in "Astronomia Nova" ("New Astronomy"). These laws are as follows:

  1. The orbit of a planet is an ellipse with the Sun at one of the two foci.
  2. A line segment joining a planet and the Sun sweeps out equal areas during equal time intervals.
Illustration of Kepler's second law

Illustration of Kepler's second law. Planet 1 travels through both areas and in the same time. By Hankwang - Own work, CC BY 2.5, Link

The third law was published in 1619 in "Harmonices Mundi" ("The Harmony of the World"):

  1. (The harmonic law) The orbital period of a planet, , is related to the average distance from the Sun, , by the relation . Equality occurs when is measured in years and is measured in astronomical units (AU).

An astronomical unit is defined as the average distance from the Earth to the Sun, which is approximately meters.

To describe an elliptical orbit mathematically, we need to establish some definitions. An ellipse is defined by its semimajor axis and its two focal points. The semimajor axis, , is half the length of the longest diameter of the ellipse. The two focal points, and , are located along the major axis, equidistant from the center of the ellipse. The focus in which the sun resides is denoted , and is known as the principal focus. We can then define an ellipse as the set of points that satisfy the equation

where and are the distances from a point on the ellipse to the two focal points, respectively. We can see a diagram of an ellipse below.

The eccentricity of an ellipse, , is a measure of how "stretched out" the ellipse is. It is defined as the ratio of the distance between the two focal points to the length of the major axis:

where is the distance between the two focal points. The point along the ellipse that is closest to the principal focus is known as the perihelion, and the point that is farthest from the principal focus is known as the aphelion. The precession of Mars's perihelion was one of the first tests of Einstein's general relativity, as we previously discussed in the general relativity chapter.

At the ends of the semiminor axis, we have , and at the ends of the semimajor axis, we have and (or vice versa). Pythagoras's theorem tells us that , so

Also, from Pythagoras's theorem on the triangle formed by and the point on the ellipse, we have

Substituting (from the definition of an ellipse) and rearranging, we have

This allows for a description of the orbit of a planet in polar coordinates, with the principal focus at the origin. Note that the exact motion of this idealized planet must be approximated using numerical methods. This is because the equation

where is the mean anomaly, is the eccentric anomaly, and is the eccentricity, cannot be solved for in terms of elementary functions. The mean anomaly is defined as

where is the orbital period, is the time, and is the time at which the planet is at perihelion. With a value for , the orbit can be described using the parametric equations

The area swept out by the line segment joining the planet and the Sun can be calculated using the formula for the area of an ellipse, . The area swept out in a time interval is given by

Lastly, different ranges of eccentricity correspond to different conic sections:

  • : Ellipse (closed orbit)
  • : Parabola (open orbit)
  • : Hyperbola (open orbit)

These shapes all have one focus, which is where the central body (e.g., the Sun) is located. We have used all of these conic sections before. Parabolas occur in projectile motion when air resistance is negligible, and hyperbolas occur in the trajectories of some comets. In addition, as the Minkowski metric has a hyperbolic geometry, hyperbolas are fundamental to special relativity.

Summary and Next Steps

In this chapter, we have discussed the history of astronomy and the development of models to describe the motion of celestial objects. We have seen how the geocentric model was replaced by the heliocentric model, and how the idea of uniform, circular motion was abandoned in favor of elliptical orbits.

Here are the key takeaways from this section, labeled based on who discovered them:

  • Eudoxus of Cnidus, Aristotle et al.: Proposed the geocentric model of the universe, with Earth at the center and celestial objects moving in circular orbits around it.
  • Claudius Ptolemy: Refined the geocentric model by introducing epicycles and equants to explain the motion of planets.
  • Nicolaus Copernicus: Proposed the heliocentric model, with the Sun at the center and planets moving in circular orbits around it.
  • Tycho Brahe: Made the most accurate observations of planetary positions up to that time.
  • Johannes Kepler: Formulated the three laws of planetary motion, describing the motion of planets in elliptical orbits around the Sun.

All of these discoveries were prior to the development of Newtonian mechanics, and notably lacked a physical explanation for the motion of celestial objects. In the next chapter, we will discuss how Newton's laws of motion and his law of universal gravitation provided a physical explanation for the motion of celestial objects, and how they were able to explain the observations made by Brahe and the laws formulated by Kepler.